Python for Big Data

Python for Big Data
Basic stack
numpy
scipy
pandas
"Python for Data Analysis" by Wes McKinney
scikits image
scikits learn
scikits statsmodels
nltk
matplotlib
Newer packages
Numba
wiseRF
Blaze
Integrated platforms
Continuum.io
Anaconda
Wakari
PiCloud
Python + AWS
wise.io
MLaaS
RandomForest
ipython
Notebook
Orange
Visualization
matplotlib
Bokeh
ggplot for python
Mayavi
Nodebox
igraph
pandas
pandas.tools.rplot
Google APIs
googleVis
Data formats
Flat text
xreadlines
readLines
pandas
read_csv
read_fwf
xlrd/xlwt/xlutils
HDF5
PyTables
h5py
SQL
SQLAlchemy
pysqlite3
pyodbc
Vertica
Netezza
Teradata
NoSQL
MongoDB
PyMongo
CouchDB
couchdb-python
couchdbkit
JSON
Standard library
json
simplejson
XML
Standard library
xml
HBase
HappyBase
MapReduce
Hadoop interface
Hadoop Streaming
Hadoopy
example
dumbo
mrjob
Pydoop
uses Hadoop Pipes
disco
Glue
rpy2
R
PySpark
Spark
ipython
magic
R
SQL
matlab/octave
IDL
Jython
Java
boto
Amazon Web Services
GPU
NumbaPro
PyCUDA
Parallel
ipython
ipcluster
pp
dispy
Efficiency
Cython
Packages
PyPI
30686 packages
102 1