Jonathan Dufault

Jonathan Dufault

it’s usually a data problem

Full-stack data engineer and scientist, turning questions into projects. Personal work circles around NLP, food science, Arduino, infrastructure, and cartoon cats. Code is maybe 40% of figuring something out. The rest is just staring at it until it makes sense.

Recent Projects

  • When the Model Isn’t the Answer
    If you stare at any two datasets long enough, you can convince yourself there’s a connection between them. Not because there is, but because there is an important enough question that the data “should” be connected. It’s a dangerous place from which to start a modeling project. This is one… Read more: When the Model Isn’t the Answer