Articles data-engineering

data pipeline from sensors to database to visualization

Data pipeline with Docker, InfluxDB, and Grafana

Set up a typical data pipeline as may be used in experimental science and IOT, with Docker, InfluxDB, and Grafana.

Read More →

remote mongoDB connection

MongoDB : Remote Access (raspberry pi)

With this uncomplicated recipe, you will learn how to manage ports on your raspberry pi (or any Debian), and how to enable remote access to your database server. The installation of mongodb on the pi is also covered.

Read More →

connecting to mongoDB from a jupyter notebook

MongoDB in 10' : Fill with python, Read with pandas

Set up your first MongoDB server, store data with python, and analyze it with pandas in a jupyter notebook.

Read More →

word encoding for natural language processing

Text Preprocessing For Machine Learning (yelp dataset part 2)

The yelp dataset is large, and it's in text format. Here are detailed explanations and all the code needed to convert it to a numpy array for machine learning.

Read More →

panda

Simple Text Mining with Pandas

Prepare the yelp dataset (shop reviews) for natural language processing.

Read More →