Andrey Kamenov, Ph.D. Probability and Statistics
Andrey Kamenov is a data scientist working for Advameg Inc. His background includes teaching statistics, stochastic processes and financial mathematics in Moscow State University and working for a hedge fund. His academic interests range from statistical data analysis to optimal stopping theory. Andrey also enjoys his hobbies of photography, reading and powerlifting.
The deadliest shooting in U.S. history happened last year in Las Vegas. But there is much more to gun-related incidents. The Gun Violence Archive database is one of the most complete sources of data. By now, it has records of over 30,000 gun-related incidents that happened this year — and that’s excluding accidental shootings.
Is there any pattern? Or are these incidents completely random?
Here’s a plot showing the total number of incidents, day-by-day, from the start of the year.
It’s relatively well-known by now that the population of the United States is becoming more diverse each year. A few weeks ago, the U.S. Census Bureau released its 2016 population estimates. Let’s see what the most recent data says about the population’s racial and ethnic backgrounds.
The most dangerous jobs are usually the ones we associate with manual labor. Heavy and civil engineering construction is an industry sector that usually comes to mind when thinking about occupational injuries. Well, it’s certainly risky at around 20 fatal injuries per 100,000 employees per year; however, it’s far from having the most dangerous jobs in the country.
Males are older than their partners in two-thirds of all unmarried straight couples. In 22 percent of all cases, the difference is greater than five years. The opposite case, where the female is at least five years older, takes place in just 10 percent of all relationships.
Here’s the detailed chart showing the complete distribution.
The percentage of America’s Hispanic population has been steadily increasing in the recent years. According to the U.S. Census Bureau’s 2016 population estimates, it’s at 17.8 percent now, up from 16.4 percent six years ago.
Here’s an interactive map showing the percentage of Hispanic population by state and county. Note that you can click on any state to zoom in.
Do you live in one of the largest cities? Chances are that you don’t: the 10 largest cities make up only 8 percent of the country’s population. This figure has remained stable in the recent years; in 2000, the number was slightly higher at 8.5 percent.
Where are the most innovative companies in the U.S. located? For most, the answer seems quite obvious: Silicon Valley.
The San Jose-Sunnyvale-Santa Clara Metropolitan Statistical Area (MSA) has held the top spot since at least 2000. Several companies made their contributions — AMD, HP, Cisco and Apple, to name a few. In 2015, Google was issued the largest number of patents in Silicon Valley. We took a deeper look at the innovation here in our earlier post.
We saw in a recent post that Prosper’s own rating system provides relatively accurate results. However, there is some confusion as to whether it takes returning borrowers into account. As we will show in this post, it is possible to improve upon the standard rating system, thanks to the fact that the number of previous Prosper loans is readily available.