en fr

Generate datasets to understand some clustering algorithms behavior

Posté le Sun 11 November 2018 dans machine learning • Taggé avec clustering, R, machine learning • 7 min read

In order to understand how a clustering algorithm works, good sample datasets are useful to highlight its behavior under certain circumstances. This post shows how to generate 9 datasets which will be used in other posts of this series on clustering.


Continuer à lire

Data classes in Python

Posté le Sat 27 October 2018 dans coding • Taggé avec Python • 5 min read

This blob post presents the brand-new Python data classes feature and its benefits.


Continuer à lire

Static typing in Python

Posté le Sat 13 October 2018 dans coding • Taggé avec Python • 5 min read

This article covers static typing in Python: how and why type annotate Python code, how to type check statically, and how to enable powerful IDEs features.


Continuer à lire

Encoding in Python

Posté le Sat 29 September 2018 dans coding • Taggé avec Python • 6 min read

The transition from Python 2 to Python 3 caused some problems since the two versions handle text differently. First, we will see how the text is represented in Python 2 and Python 3, then how to do the conversion between the different representations, and then the different places where encoding step in: the encoding of the source code, the implicit conversions, the encoding of the inputs and outputs, and the file system encoding.


Continuer à lire

XXVth Meeting of the Société Francophone de Classification

Posté le Sun 16 September 2018 dans Meeting • Taggé avec Clustering • 4 min read

Last week, I was at the XXVth Meeting of the Société Francophone de Classification, both as a participant and a member of the steering committee.


Continuer à lire

Key differences between mainly used languages for data science

Posté le Sat 01 September 2018 dans Coding • Taggé avec Javascript, Python, Scala, C • 6 min read

This blog post introduces the notions of strongly and weakly typed in one hand, and the notion of static and dynamic typing in the other hand. It is illustrated with four languages commonly used in data science pipelines.


Continuer à lire

Purpose of this blog

Posté le Sat 01 September 2018 dans misc • 1 min read

This article is an introduction to this blog.


Continuer à lire