Update all Python packages with pip
pip list --outdated --format=freeze | grep -v '^-e' | cut -d = -f 1 | xargs -n1 pip install -U
orpip freeze --local | grep -v '^\-e' | cut -d = -f 1 | xargs -n1 pip install -U
Install package without installing the dependencies
pip install -U --no-deps mypackage
ssh USER@hadoop.rcc.uchicago.edu
login to RCC, then enter password
ls
list all directories
mkdir newdir
makes new directory named newdir
rm -rf newdir
deletes directory named newdir
clear
clears screen
cd newdir
goes into newdir
cd
goes to home
pwd
to see which file you're in
rm file
to remove 'file'
ssh user@host
connect to host as user
ctrl+c
halt current command
crtl+z
stop current command
ctrl+d
log out of current session
ctrl+w
erases one word in current line
ctrl+u
erases whole line
!!
repeat last command
exit
log out of current session
wget <url>
download file directly
mv orignalFile name newFileName
to change file name
tar zxvf instacart.tar.gz
unzip file
hadoop fs -put /home/$USER/data/instacart /user/$USER/instacart
to put it to hadoop
- Harvard Introduction to Computer Science
- Notes that accompany the Stanford CS class: Convolutional Neural Networks for Visual Recognition
- Plotting and Graphing
- Best Practices
- Principal Component Analysis
- Clustering
- Validation
- Ensemble Methods
- Gradient Descent Optimisation
- Neural Network/Deep Learning Overview and Training
- Convolutional Neural Networks and Images
- Good explainers here and here
- Feature Visualization how neural networks build up their understanding of images
- Combine the content of one image with the style of another image and another Photo Style Transfer and [another example](https://github.com/jcjohnson/neural-style/
- CNN example with MNIST dataset
- Convolution animations
- Convolutional Neural Networks for Visual Recognition
- ResNet, AlexNet, VGGNet, Inception: Understanding various architectures of Convolutional Networks
- An Overview of ResNet and its Variants
- What is wrong with Convolutional neural networks?
- Convolutional Neural Networks
- Understanding layers in NN from google AI blog
- ImageNet Classification with Deep Convolutional Neural Networks
- Pretrained Image Classifier here and here
- DeepFace: Closing the Gap to Human-Level Performance in Face Verification
- OpenFace: A general-purpose face recognition library
- Convolutions and Backpropagations
- Transfer Learning
- Natural Language Processing
- Recommender Systems
- Recurrent Neural Networks and Long Short-Term Memory (LSTM)
- Blockchain
- Big Data
- Differences between Hive Internal and External Tables
- Predicting Breast Cancer Using Apache Spark Machine Learning Logistic Regression
- MapReduce processes explained but also explained here
- Apache YARN (Yet Another Resource Negotiator): Hadoop’s cluster resource management system
- Wide VS Narrow Dependecies: Representation/DAG of what Spark analyzes to do optimizations
- Time Series
- Miscellaneous
- R for Data Science textbook
- Intro to Stats
- MIT Deep Learning
- O'Reilly book Hands-on Machine Learning with Scikit-Learn and TensorFlow
- Pre-trained models: Model Zoo for transfer learning
- Chicago Data
- India Data
- Airline Dataset
- List of Public Data Sources Fit for Machine Learning
- Million Song Dataset
- Classification datasets
- 2013-2016 Cleaned & Parsed 10-K Filings with the SEC
- Stanford Large Network Dataset Collection
- Google Dataset Search
- Fashion MNIST with 60k train and 10k test images
- notMNIST dataset Similar to MNIST but used different fonts
- The CIFAR-10 dataset 10 classes with 6k images per class
- Large Scale Visual Recognition Challenge 2012 and 2014
- GeoDa Data Portal mostly smaller sample datasets for use in learning spatial analysis
- spData R package accompanies Geocomputation with R
- geodaData R package currently in development, but includes all the packages in the GeoDa tutorials
- Tidycensus R package useful for getting US boundaries, variables
- OpenStreetMap can use API or osmdata R package to get data CMAP Data Hub: Chicago Metropolitan Agency for Planning (36 datasets, including community areas)
- NYU Spatial Data Repository lots of data and links to other institutions
- ICPSR data portal data archive for the social sciences
- GeoDa Datasets on Github. In R:
install.packages('remotes') remotes::install_github("spatialanalysis/geodaData") library(geodaData) data("chicago_comm")
- Search on StackExchange here and here