Skip to Content
Python Machine Learning
book

Python Machine Learning

by Wei-Meng Lee
April 2019
Intermediate to advanced
320 pages
6h 42m
English
Wiley
Content preview from Python Machine Learning

CHAPTER 2Extending Python Using NumPy

What Is NumPy?

In Python, you usually use the list data type to store a collection of items. The Python list is similar to the concept of arrays in languages like Java, C#, and JavaScript. The following code snippet shows a Python list:

list1 = [1,2,3,4,5] 

Unlike arrays, a Python list does not need to contain elements of the same type. The following example is a perfectly legal list in Python:

list2 = [1,"Hello",3.14,True,5] 

While this unique feature in Python provides flexibility when handling multiple types in a list, it has its disadvantages when processing large amounts of data (as is typical in machine learning and data science projects). The key problem with Python's list data type is its efficiency. To allow a list to have non‐uniform type items, each item in the list is stored in a memory location, with the list containing an “array” of pointers to each of these locations. A Python list requires the following:

  • At least 4 bytes per pointer.
  • At least 16 bytes for the smallest Python object—4 bytes for a pointer, 4 bytes for the reference count, 4 bytes for the value. All of these together round up to 16 bytes.

Due to the way that a Python list is implemented, accessing items in a large list is computationally expensive. To solve this limitation with Python's list feature, Python programmers turn to NumPy, an extension to the Python programming language that adds support for large, multidimensional arrays and matrices, along with ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Python Machine Learning

Python Machine Learning

Sebastian Raschka
Python Machine Learning - Third Edition

Python Machine Learning - Third Edition

Sebastian Raschka, Vahid Mirjalili

Publisher Resources

ISBN: 9781119545637Purchase book