Reference 1: Python Interview Questions
Reference 2: Python tips

Key Features

Interpreted language

Languages that do not need to be compiled before it is run, e.g. Python, PHP, Ruby.

Dynamically typed language

You do not need to declare the type of a variable like you have to do with C or Java.

OOP

An object oriented programming language, as opposed to procedure oriented programming. Related concepts: class, object, method, constructor.

Inheritence

Base class (parent class) -> derived class (child class)
Types of inheritence:
- Single inheritence
- Multi-level inheritence: base1 -> d1 -> d2
- Hierarchical inheritence: inherit any number of child classes from one base class
- Multiple inheritence: inherit from multiple parent classes
Call mro() to view the execution order

Polymorphism

Say a parent class has a function foo(). A child class can also have a function with the same name, but with different parameters and variables.

Encapsulation

Binding code and data together. A Python class is an example of encapsulation.

Class Variables vs. Instance variables

Class variables are shared between different instances of a class
Instance variales are unique to every object of that class

Memory management

Python private heap space: all Python objects and data structures are stored in a private heap that's managed by Python's memory manager. Programmers do not have access to this private heap.
Python has an inbuilt garbage collector that recycles unused memory so that it can be made available to the heap space.

Important Concepts

Namespace

Reference

A naming system to organize symbolic names assigned to objects in Python.

Four diffrent kinds of namespaces in Python:

Built-in. Starts when Python interpreter is running, ends when the interpreter terminates.
Global. Starts when the main program body starts, ends when the interpreter terminates.
Local. Starts when a function gets executed in the main program, ends when the function terminates.
Enclosing. Starts when a function gets called inside a function (an enclosed function), ends when the function terminates.

When a name is mentioned, Python searches for the variable with that name by the LEGB rule (Local -> Enclosing -> Global -> Built-in).

PYTHONPATH

An environment variable that is used when importing modules. The interpreter will look for the imported modules in various directories included in PYTHONPATH.

Global variables vs. Local variables

Declared outside of a function or in global space vs. declared inside a function

Attibutes

Instance attributes: unique to each object
Class attributes: unique to each class

args and kwargs

*args for passing in an unknown number of arguments
*kwargs for passing in an unknown number of keyword arguments (dictionary)

Dunder methods

Dunder for "Double Underscores" methods
Commonly used for operator overloading
A few examples:
- __init__: called whenever an instance of a class is initialized
- __getitem__: return the result of accessing the instane using the [] (indexer) operator.

Pass by assignment

Reference

Pass by reference: the argument passed to the function is a reference to a variable that already exists in memory, instead of an independent vopy of that variable
Pass by assignment: when you pass an argument to a function, If an object representing the value (e.g. int 2) already exists, the name is bounded to that object, and the reference counter of the object is incremented. An entry is added to the namespace to bind the name with the object.

Effect: when multiple names are assigned to the same object, Python increments the reference counter for that object, instead of creating duplicated copies of it.

Multithreading

Reference: Concurrency in Python

Thread

Thread is the smallest executing unit in an OS.
Five states (Life cycle) of a thread:
- New thread. Not yet started, no resources have been allocated.
- Runnable. Resources have been allocated but task scheduler has not scheduled it to run.
- Running. Executes tasks.
- Not running/waiting. Thread is paused because it's either waiting for the response of some I/O request or waiting for the completion of execution of another thread.
- Dead. Thread terminated.

Multi-threading in Python

Dividing a process into multiple threads that share the same address space. Code and data are shared among threads. Each thread has its only register, program counter, and stack.
While these processes can be ran on different CPUs concurrently to achieve speedups, Python disallows such behavior through its Global Interpreter Lock (GIL). GIL makes sure that only one of the processes can execute at any given time. Actually, GIL adds overhead to the running time, so the total running time could increase due to multithreading.
Benefits:
When a thread is executing a blocking operation, such as reading or writing to file, another thread could be executed first.
Sharing of data
Synchronization: necessary to prevent two threads from accessing shared resources. Python prevents this using a lock mechanism.

Concurrency vs. Parallelism

Concurrency is when two or more events are happening at the same time.
Parallelism is when a task is broken into subtasks that can be processed in parallel. The subtasks are distributed among different cores of a single CPU or among multiple computers connected within a network.

Types & Data Structures

List

Mutable, ordered, can hold different data types

Array

Mutable, ordered, hold a single data type

Tuple

Immutable, ordered, can hold different data types

Set

Mutable, orodered, can not store duplicated items

Dictionary

Keys must be unique; ordered in Python 3.7

Numpy

Ravel vs. Flatten

a1 = a.flatten('F') is the same as a2 = np.ravel(a) except that flatten() will return a new array while ravel() only creates a shallow copy of the array. Thus, changing a2 will change a as well.

Squeezing & Expanding

Reduce array dimensions by dropping axes that are of unit length. E.g. an array of shape (1,4,1,5) becomes shape(4,5) after calling a = np.squeeze(a). You can specify the specific axis to reduce by passing in the axis parameter.
np.expand_dims(a, axis = 1) is the opposite operation to np.squeeze(a, axis = 1).

Concatenate vs. Stack

np.concatenate((arr1, arr2), axis = 0) requires arrays to have the same shape. The return value has the same dimension as the input arrays.
np.stack((arr1, arr2), axis = 0) requires arrays to have the same shape. The return value will have one more dimension than the input arrays (the new dimension is specified with axis parameter).

Repeat

np.repeat(arr, 3, axis = 1): repeat each element along axis 1 by 3 times

Broadcasting

Makes possible numeric operations between two arrays of different shapes.

PyTorch Tensors

Similar to numpy arrays except that they can run on GPUs or other hardware accelerators. Their computations can be parallelized to reduce runtime.

numpy to tensor: torch.from_numpy(arr)
tensor to numpy: t1.detach().numpy()

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

python-facts.md

python-facts.md

Key Features

Interpreted language

Dynamically typed language

OOP

Inheritence

Polymorphism

Encapsulation

Class Variables vs. Instance variables

Memory management

Important Concepts

Namespace

PYTHONPATH

Global variables vs. Local variables

Attibutes

args and kwargs

Dunder methods

Pass by assignment

Multithreading

Thread

Multi-threading in Python

Concurrency vs. Parallelism

Types & Data Structures

List

Array

Tuple

Set

Dictionary

Numpy

Ravel vs. Flatten

Squeezing & Expanding

Concatenate vs. Stack

Repeat

Broadcasting

PyTorch Tensors

Files

python-facts.md

Latest commit

History

python-facts.md

File metadata and controls

Key Features

Interpreted language

Dynamically typed language

OOP

Inheritence

Polymorphism

Encapsulation

Class Variables vs. Instance variables

Memory management

Important Concepts

Namespace

PYTHONPATH

Global variables vs. Local variables

Attibutes

*args and *kwargs

Dunder methods

Pass by assignment

Multithreading

Thread

Multi-threading in Python

Concurrency vs. Parallelism

Types & Data Structures

List

Array

Tuple

Set

Dictionary

Numpy

Ravel vs. Flatten

Squeezing & Expanding

Concatenate vs. Stack

Repeat

Broadcasting

PyTorch Tensors

args and kwargs