Mathew Kiang (.com) | Yes, with one "t".

Using a histogram as a legend in choropleths

January 16, 2017

Despite well known drawbacks,¹ plotting parameters onto maps provides a convenient way of seeing context, patterns, and outliers. However, one of the many problems with choropleths is that the area of the regions tend to distort our perception of the value of the region. For example, in the United States, huge (in terms of land mass) counties will tend to have a greater visual impact than small counties (despite often having similar or even smaller population sizes).

One way to address this is to use a histogram as a legend on your map. The histogram then provides you with a way of showing raw counts of equal weights while the map allows you to provide the spatial context of the values.

Show 1 footnote

On graduate student burnout: “It isn’t usually a snap so much as a gradual disintegration.”

January 7, 2017

PSA: Applications are open for the 2017 Data Science for Social Good Summer Fellowship. 10/10 would do again.

December 9, 2016

Use bash to concatenate files in R

November 9, 2016

Often, I find I need to loop through directories full of csv files, sometimes tens of thousands of them, in order to combine them into a single analytical dataset I can use. When it’s only a few dozen, using fread(), read_csv, or the like can be fine, but nothing is quite as fast as using awk or cat.

Here’s a snippet of code that allows one to use bash in R to concatenate csv files in a directory. People in the lab have found it helpful so maybe others will as well.

A visual tour of my publications

October 8, 2016

I recently came across this paper by Michal Brzezinski about (the lack of) power laws in citation distributions. It made me a little curious about the citations of my own articles so I threw together a little script using James Keirstead’s Scholar package for R. In the plot above, every line represents a single article with time on the x-axis and (cumulative) number of citations on the y-axis.

It’s not super informative, so we can break it down a few ways to graphically explore the data.

Our unsurprising result: Among other things, mutual respect is important for implementing large-scale healthcare initiatives.

September 24, 2016

And associated Reddit Science AMA on our last essay…

December 21, 2015

Sadly relevant: Our essay on police killings and police deaths.

December 21, 2015

Reporter posts his mobile phone metadata for the public to analyze

August 17, 2015

The colon operator really is the fastest.

June 9, 2015

Way back when I was first learning R, I ran across an old listserv post that talked about how the colon (:) operator was the fastest way to generate a sequence. I never really thought about it, but I got in the habit of always using it whenever I needed a sequence.

Our “deaths due to ‘legal interventions'” essay. It’s not peer-reviewed (and it’s a couple months old), but increasingly relevant.

April 30, 2015

tl;dr: How you implement things — even simple things like checklists — is important.

April 6, 2015

New MetroCard rates and the dreaded “dead zone of change”

March 23, 2015

Looking at abortion laws and infant deaths

March 12, 2015

Boston’s brutal winter

March 11, 2015

Waterfalls of Eligible Singles

March 8, 2015

As a Valentine’s Day (gag) gift to one of my friends, I created a Shiny app¹ that will calculate the number of people in the United States who meet specified sex, age, marital status, race/ethnicity, educational attainment, employment status, and annual income requirements.

Show 1 footnote

tl;dr version: Discrimination is bad for everybody (especially those discriminated against).

June 25, 2014

Our new paper’s punchline: Health inequities need not rise as population health improves.

March 25, 2014

Shiny + deSolve = Interactive ODE Models

December 20, 2013

While taking a disease dynamics course, I thought it would be a good opportunity to learn how to use the Shiny package in R and create an interactive interface for some of my problem sets. After a few trial runs with smaller, simpler setups, I have wrapped up the side project (for now). You can see it in action here ¹ and you can view the final code on my Git.

Show 1 footnote