This is a post which I will update periodically with some of my favorite Quora (& Stackexchange) answers.
- Gamma & Poisson Distributions
- Why is Spark faster than MapReduce?
- Importance of independence in statistical modeling
- Importance of underlying data distributions
- Biggest lessons learned in corporate
- Free open datasets
- Most common data science mistakes
- SVM and Kernels
- SVD and PCA
- Data Science Tools
- Judging a Good Data Scientist (Histograms)
- Math important for ML (by Andrew Ng)
- How to get a job
- The logistic loss in XGBoost
- Hardcore negotiations
From Clay Ford’s blog:
HackerNews:
- Blue collar programmers
- Resources on organization
- Being a data scientist at small-medium companies
- Taking metric game-ification seriously
Singular Value Decomposition:
Other: