Coursework

Spring 2024

Scope: Pandas [Series, DataFrame; slicing, grouping, aggregating]; data cleaning and EDA [granularity, scope, and temporality]; RegEx; visualization [Matplotlib, Seaborn], kernel density estimate; sampling [convenience/probability/random sample, chance error; selection/response/non-response bias]; modeling [bias-variance tradeoff, complexity], loss functions [MAE, MSE], OLS; feature engineering [one-hot encoding, polynomial features]; sklearn, gradient descent [learning rate, dimensionality, batch size], PyTorch [Stochastic GD]; cross-validation, regularization; random variables [expectation, variance], distribution [Bernoulli, Binomial, Uniform, Gaussian]; SQL; logistic regression, cross-entropy loss [decision rule, threshold]; linear separability, confusion matrix [accuracy, precision, recall], ROC curve [AUC]; PCA [singular value decomposition, data variance and centering]; clustering [K-Means; hierarchical with single/average/complete link], inertia, silhouette score.

Course notes can be accessed here.

Scope: list, array, SLL, doubly linked list, inheritance, casting, HOF, comparators; B-tree, red-black tree; hashing; priority queue, stack, min-max heap; graph, tree traversals [DFS, BFS]; Dijkstra's algorithm for shortest path, Kruskal's and Prim's algorithm for MST, A * search; DAG; refix operations, tries; selesction sort, heapsort, mergesort, insertion sort, quicksort; radix sort; asymptotics, time and space complexity, compression.

Largely based on projects on Java. Worked in IntelliJ.

Scope: proofs [induction], propositional logic; stable matching; modular arithmetic, Chinese Remainder Theorem; RSA, Fermat's Little Theorem; polynomials, Lagrange interpolation, error-correcting code; graphs, Eulerian, Hamiltonian tour/walk; counting, countability; random variables [Uniform, Bernoulli, Binomial, Gaussian, Poisson, Geometric]; expectation, variance, PDF, CDF; sampling, confidence interval; continuous, joint probability; continous probability, Markov's, Chebyshev's inequality; Markov chains.

Scope: matrix operations, determinants; vector spaces, inner product spaces; eigenvalues and eigenvectors; linear transformations, symmetric matrices; linear ordinary differential equations (ODE), systems of linear ODE; Fourier series.

Fall 2023

Scope: visualization [histograms]; sampling, distributions [normal, skewed], Chebyshev's Inequality, Central Limit Theorem; inference, causality, confounding factors; hypothesis testing [null/alternative, test statistic, p-value cutoff], A/B testing; bootstrapping, confidence intervals; linear regression, Least Squares; Nearest Neighbors.

Worked with datascience library based on Pandas.
The course textbook can be accessed here.

Scope: data abstraction using interfaces, objects, classes, and generic operators; recursion, higher-order functions, generators, streams, currying; interpreters, macros.

Wrote a Scheme interpreter based on Python as part of the course.

Scope:
• Statistical methods: Introduction to probabilities, weighted averages, correlations, data visualization, cross-tabulation, t-hypothesis tests, ANOVA tests, confidence intervals, A/B causality tests, chi-squared tests, and linear regression with non-linear variables, discontinuity, and seasonality.
• Business functions: Explore sales, market research, fundraising, customer service, corporate training, process improvement, online marketing, travel operations, procurement, financial planning & analysis, staffing, and other business functions.

Worked with Python and SQL. Largely based on labs with case studies.