Path: blob/master/resources/student-resources/ds-vocab.md
1904 views
Data Science Vocabulary List
##Unit 1
Box plots
Classification problem
Confidence intervals
Correlation
Dataframe
Histograms
Line graph
Logistic Model
Mean
Median
Mode
NumPy
P-values
Pandas
Quartile
Range
Reliable
Reproducible
Scatter matrix
Scatter plots
Secondary data
Standard deviation
Variance
Visualization
##Unit 2
AOC - Area over the curve
Bias
Categorical variables
Classification
Confusion Matrix
Continuous variables
Cross Validation
Generalized Error
Global Minimum
Gradient Descent
K-Fold
KNN - K-Nearest Neighbor
Lasso Regression
Linear regression
Link Function
Local Minimum
Logistic regression
Loss functions
Mean absolute error
Mean squared error
Minkowski Metric
Model Fit
Odds / Odds Ratios
Ordinary Least Squares
Overfitting
R-Squared
Regularization
Residual
Ridge Regression
ROC - Receiver operating characteristic
Root mean squared error
Sampling
Sigmoid Function
Variance
##Unit 3
Aperiodic cycles
ARIMA model - Autoregressive integrated moving average
Autocorrelation
Collinearity
Decay
Decision trees
Denormalized
Detrending
Differencing
Dimensional reduction
Document-based databases
Lag
LDA - latent dirichlet allocation
Mean scaling
Moving Average
multi-collinearity
Normalized
NLP - natural language processing
NLTK - natural language toolkit
Non-stationary
Periodicity
Random forests
Refactor
Relational databases
Residuals
Rolling means
Stationarity
Time-series
Tokenize
Topic models
Unit Testing
Weighted Moving Average