O'Reilly logo

Analytics and Tech Mining for Engineering Managers by Jan H. Kwakkel, Scott W. Cunningham

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

         CHAPTER 5         

PARSING COLLECTED DATA

In this chapter, we dive into the heart of tech mining by parsing records and storing them in a format for easy retrieval and analysis. Cleaning, parsing, and filtering the data is a major task in tech mining, which often requires a lot of effort. Because of this, this chapter is the first of three chapters on the general topic of cleaning the data.

As discussed in the previous chapter, records of scientific articles come in a variety of formats. In this chapter, we will discuss row- and column-structured examples in particular. These examples build upon, and expand, the Python basics discussed in Chapter 3. In this chapter dictionaries are applied to storing and structuring real examples of ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required