Book a Demo!
CoCalc Logo Icon
StoreFeaturesDocsShareSupportNewsAboutPoliciesSign UpSign In
YStrano
GitHub Repository: YStrano/DataScience_GA
Path: blob/master/resources/student-resources/ds-vocab.md
1904 views

Data Science Vocabulary List

##Unit 1

  • Box plots

  • Classification problem

  • Confidence intervals

  • Correlation

  • Dataframe

  • Histograms

  • Line graph

  • Logistic Model

  • Mean

  • Median

  • Mode

  • NumPy

  • P-values

  • Pandas

  • Quartile

  • Range

  • Reliable

  • Reproducible

  • Scatter matrix

  • Scatter plots

  • Secondary data

  • Standard deviation

  • Variance

  • Visualization

##Unit 2

  • AOC - Area over the curve

  • Bias

  • Categorical variables

  • Classification

  • Confusion Matrix

  • Continuous variables

  • Cross Validation

  • Generalized Error

  • Global Minimum

  • Gradient Descent

  • K-Fold

  • KNN - K-Nearest Neighbor

  • Lasso Regression

  • Linear regression

  • Link Function

  • Local Minimum

  • Logistic regression

  • Loss functions

  • Mean absolute error

  • Mean squared error

  • Minkowski Metric

  • Model Fit

  • Odds / Odds Ratios

  • Ordinary Least Squares

  • Overfitting

  • R-Squared

  • Regularization

  • Residual

  • Ridge Regression

  • ROC - Receiver operating characteristic

  • Root mean squared error

  • Sampling

  • Sigmoid Function

  • Variance

##Unit 3

  • Aperiodic cycles

  • ARIMA model - Autoregressive integrated moving average

  • Autocorrelation

  • Collinearity

  • Decay

  • Decision trees

  • Denormalized

  • Detrending

  • Differencing

  • Dimensional reduction

  • Document-based databases

  • Lag

  • LDA - latent dirichlet allocation

  • Mean scaling

  • Moving Average

  • multi-collinearity

  • Normalized

  • NLP - natural language processing

  • NLTK - natural language toolkit

  • Non-stationary

  • Periodicity

  • Random forests

  • Refactor

  • Relational databases

  • Residuals

  • Rolling means

  • Stationarity

  • Time-series

  • Tokenize

  • Topic models

  • Unit Testing

  • Weighted Moving Average