Marco Santoni


Data Scientist

My Talk about Superset [Python Milano Meetup]

Yesterday, I gave a talk Python Milano Meetup. The Meetup was designed as Python pills: three 20-minutes talks in a row. The talks: ...

Manufacturing. When data is not a commodity

What does it mean to work as a data scientist in manufacturing? What is the value behind data? Data science has gained popularity in ...

Weighted Random Sampling with PostgreSQL [Follow-up]

I received valuable feedbacks by Jim Nasby regarding the post about weighted random sampling with PostgreSQL. I will report here Jim's ...

Monitoring Bus Frequencies in Rome

I have just launched atacmonitor. It is a website providing information about the waiting time at bus stops in Rome. Overview The ...

Blog Migrated to Pelican on GitHub Pages

I have migrated my blog. It is built under Pelican, a static site generator. It allows me to write posts as plain markdown or even ...

Insights from IEEE Big Data 16

I have attended the IEEE Big Data 16 conference in Washington DC. I thank my company for sponsoring the trip. The conference included a ...

Weighted Random Sampling with PostgreSQL

You have a table like the following: CREATE TABLE weights ( color varchar primary key, weight float ); INSERT INTO weights (color, ...

Parallel Computing in Python with concurrent.futures

Going Parallel with concurrent.futures¶Parallel computation can be implemented via the concurrent.futures module. The module is part of ...

Applied Bayesian Inference with PyMC [video]

I was glad to give an intro to Bayesian Inference at PyData Florence 2016. The video of the talk is now out.

A Simple Machine Learning Pipeline

This post contains the code that I used in my talk at Python Milano Meetup on June 22nd 2016. The talk was a quick overview of Pipeline, ...


Page 1 / 2