How to accelerate your programming development

Photo by Jamie Hagan on Unsplash


I replicated a Google-style search engine from scratch in Java during my senior year of college. Over the course of the semester, I spent 30 hours on each of five individual assignments and over 100 hours during the last two weeks of classes working in a group of 4 on the final project for the course.

How to build neural networks that generalize well

Photo by Gwen Weustink on Unsplash

Whether we are building a classification or prediction model, our goal is for the model to perform well on data we have not seen before. This is where we generate value from our model. Doing well on data that is already labeled or has been seen before is not very important in driving value.

Best practices from large tech companies that are aggressively experimenting with everything

Photo by Ousa Chea on Unsplash

All of the most successful tech companies are constantly experimenting and improving their offering to users. A/B tests are a widely used technique to determine the impact of manipulating a feature.

“Randomized A/B or A/B/N tests are considered the gold standard in many quantitative scientific fields for evaluating treatment effects.” —Anirban Deb et al., Uber

While they play an important part in many data-driven decisions, A/B tests are hard to run successfully and harder to run at scale.

Data discovery approaches at the largest tech companies

Photo by Lina Silvanova on Unsplash

Many large technology companies make very important decisions on how they can best serve their customers through data. As these companies grow in size, the complexity of their data landscapes also grows, causing them to become increasingly difficult to navigate.

How 3 of the largest tech companies approach data quality

Photo by Charles Forerunner on Unsplash

All high-performing businesses should be leveraging data to make decisions. So much data is available to us and there are so many models and techniques that allow us to leverage it to make informed decisions that add value to customers and our businesses.

How a popular algorithm we use every day was published as a story about Ancient Greek Parliament and how it works

Photo by ESR LAW on Unsplash

The Paxos algorithm is a family of protocols that aims to solve the consensus problem for a network with unreliable nodes. Its original paper held the record for the longest publication delay (over 8 years) due to its interesting delivery as a story about an ancient Greek parliament on the island of Paxos.

“I decided to cast the algorithm in terms of a parliament on an ancient Greek island…I gave the Greek legislators the names of computer scientists working in the field, transliterated with Guibas’s help into a bogus Greek dialect…Writing about a lost civilization allowed me to eliminate uninteresting…

This is how I successfully got the most out of my college experience.

Photo by Charles DeLoye on Unsplash


I am currently a Senior in my last semester at the University of Pennsylvania. I will be graduating with concentrations in Statistics and Finance from Wharton and a Masters of Science in Engineering in Data Science from the School of Engineering and Applied Science. I will be completing all of this in four years and be working at my dream job as a Technical Analyst at Wayfair after I graduate.

Hands-on Tutorials

How to use two different simulation techniques for hypothesis testing with examples in Python

Photo by Chuttersnap on Unsplash

Hypothesis tests are an important component of statistics. They help us make important decisions in healthcare, business, ethics, and many other fields. There are many great resources explaining how to conduct these tests and how to use distributions like the normal distribution and t-distributions to get results for these tests.

How to leverage the Fresh Start Effect to achieve your goals

Photo by DEVN on Unsplash

The Fresh Start Effect

Everyone has set a New Year’s resolution at some point in their lives. Some of us keep these resolutions. Many more of us do not.

A History of Cloud Computing. Where it started and what the future looks like

Photo by Kelvin Ang on Unsplash

The first “cloud computing” project, an MIT research project called Project MAC, was founded in 1963 with funding from DARPA. The goal of this project was to allow multiple users to use programs on a single computer from multiple locations. After just 6 months, over 200 users were able to connect to the system in 10 different locations.

What is Cloud Computing

Before we get…

Kevin Babitz

Data Scientist | MSE in Data Science at University of Pennsylvania (May 2021)

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store