Browse by Date
Browse by Date
A chronological journey through 149 posts
2025
November 2025
2025
2025
2024
December 2024
2024
November 2024
2024
2024
2024
2024
April 2024
Basic DSPy RAG tutorial on DataGrapple blog posts
Basic DSPy RAG tutorial on DataGrapple blog posts
Read post2024
February 2024
2023
July 2023
2023
2023
2023
April 2023
2023
2023
2022
2022
2022
2022
2022
September 2022
2022
2022
2022
2022
2021
December 2021
2021
2021
November 2021
[paper] Top2Vec: Distributed Representations of Topics
with application on 2020 10-K business descriptions
Read post2021
October 2021
[HKML] Hong Kong Machine Learning Meetup Season 4 Episode 2
[HKML] Hong Kong Machine Learning Meetup Season 4 Episode 2
Read post2021
August 2021
Embeddings of Sectors and Industries using Graph Neural Networks
node2vec embeddings of industries projected onto the 2d plane
Read post2021
April 2021
2021
February 2021
The Swelling Effect: Think twice before averaging covariance matrices
A few ellipsoids representing the associated covariance matrices along the geodesic path from the leftmost to the rightmost matrices.
Read post2021
February 2021
Conditional CorrGAN: An example in Google Colab
A few cCorrGAN-generated correlation matrices, and the confusion matrix of a SPDNet + RBN classification.
Read post2021
January 2021
Classification of Correlation Matrices using SPDNet with Riemannian Batch Normalization
Illustration from "A Riemannian Network for SPD Matrix Learning" https://arxiv.org/pdf/1608.04233.pdf
Read post2021
January 2021
[Paper] Summary of 'Interpretable Machine Learning – A Brief History, State-of-the-Art and Challenges'
[Paper] Summary of Interpretable Machine Learning – A Brief History, State-of-the-Art and Challenges
Read post2021
2020
2020
December 2020
[Paper] Summary of 'Explaining by Removing: A Unified Framework for Model Explanation'
Illustration from Explaining by Removing: A Unified Framework for Model Explanation
Read post2020
November 2020
2020
November 2020
[HKML] Hong Kong Machine Learning Meetup Season 3 Episode 4
[HKML] Hong Kong Machine Learning Meetup Season 3 Episode 4
Read post2020
October 2020
[HKML] Hong Kong Machine Learning Meetup Season 3 Episode 3
[HKML] Hong Kong Machine Learning Meetup Season 3 Episode 3
Read post2020
October 2020
[HKML] Hong Kong Machine Learning Meetup Season 3 Episode 2
[HKML] Hong Kong Machine Learning Meetup Season 3 Episode 2
Read post2020
September 2020
[HKML] Hong Kong Machine Learning Meetup Season 3 Episode 1
[HKML] Hong Kong Machine Learning Meetup Season 3 Episode 1
Read post2020
2020
2020
2020
2020
2020
2020
August 2020
Release of a few pretrained CorrGAN models
Release of a few pretrained CorrGAN models
Read post2020
2020
2020
2020
2020
2020
July 2020
[HKML] Hong Kong Machine Learning Meetup Season 2 Episode 8
[HKML] Hong Kong Machine Learning Meetup Season 2 Episode 8
Read post2020
2020
2020
2020
June 2020
[HKML] Hong Kong Machine Learning Meetup Season 2 Episode 7
[HKML] Hong Kong Machine Learning Meetup Season 2 Episode 7
Read post2020
2020
May 2020
[HKML] Hong Kong Machine Learning Meetup Season 2 Episode 6
[HKML] Hong Kong Machine Learning Meetup Season 2 Episode 6
Read post2020
April 2020
[HKML] Hong Kong Machine Learning Meetup Season 2 Episode 5
[HKML] Hong Kong Machine Learning Meetup Season 2 Episode 5
Read post2020
2020
April 2020
[HKML] Hong Kong Machine Learning Meetup Season 2 Episode 4
[HKML] Hong Kong Machine Learning Meetup Season 2 Episode 4
Read post2020
March 2020
[Paper + Implementation] The Hierarchical Equal Risk Contribution Portfolio (Part I)
[Paper + Implementation] The Hierarchical Equal Risk Contribution Portfolio (Part I)
Read post2020
2020
February 2020
CorrVAE: A VAE for sampling realistic financial correlation matrices (Tentative II)
CorrVAE: A VAE for sampling realistic financial correlation matrices (Tentative II)
Read post2020
February 2020
CorrVAE: A VAE for sampling realistic financial correlation matrices (Tentative I)
CorrVAE: A VAE for sampling realistic financial correlation matrices (Tentative I)
Read post2020
February 2020
S&P 500 Sharpe vs. Correlation Matrices - Building a dataset for generating stressed/rally/normal scenarios
S&P 500 Sharpe vs. Correlation Matrices - Building a dataset for generating stressed/rally/normal scenarios
Read post2020
January 2020
[HKML] Hong Kong Machine Learning Meetup Season 2 Episode 3
[HKML] Hong Kong Machine Learning Meetup Season 2 Episode 3
Read post2019
2019
December 2019
Comparison of Network-based and Minimum Variance Portfolios Using CorrGAN
Comparison of Network-based and Minimum Variance Portfolios Using CorrGAN
Read post2019
December 2019
Hierarchical Risk Parity - Implementation & Experiments (Part III)
Hierarchical Risk Parity - Implementation & Experiments (Part III)
Read post2019
October 2019
[HKML] Hong Kong Machine Learning Meetup Season 2 Episode 2
[HKML] Hong Kong Machine Learning Meetup Season 2 Episode 2
Read post2019
October 2019
TF 2.0 DCGAN for 100x100 financial correlation matrices
TF 2.0 DCGAN for 100x100 financial correlation matrices
Read post2019
September 2019
[HKML] Hong Kong Machine Learning Meetup Season 2 Episode 1
[HKML] Hong Kong Machine Learning Meetup Season 2 Episode 1
Read post2019
September 2019
TF 2.0 GAN MLP for 100x100 financial correlation matrices
TF 2.0 GAN MLP for 100x100 financial correlation matrices
Read post2019
September 2019
[HKML] Supercharge your Marketing with Data & ML . [HKML <> IAB] . Off-Season #1
[HKML] Supercharge your Marketing with Data & ML . [HKML <> IAB] . Off-Season #1
Read post2019
September 2019
Permutation invariance in Neural networks
Permutation invariance in Neural networks
Read post2019
August 2019
Using LIME to 'explain' Snorkel Labeler
Using LIME to “explain” Snorkel Labeler
Read post2019
July 2019
[HKML] Hong Kong Machine Learning Meetup Season 1 Episode 12 (Season Finale)
[HKML] Hong Kong Machine Learning Meetup Season 1 Episode 12 (Season Finale)
Read post2019
July 2019
Stylized Facts of Financial Correlations
Stylized Facts of Financial Correlations
Read post2019
July 2019
CorrGAN: A GAN for sampling correlation matrices (Part II)
CorrGAN: A GAN for sampling correlation matrices (Part II)
Read post2019
June 2019
CorrGAN: A GAN for sampling correlation matrices (Part I)
CorrGAN: A GAN for sampling correlation matrices (Part I)
Read post2019
June 2019
[ICML 2019] Day 5 - Workshop Time Series
[ICML 2019] Day 5 - Workshop Time Series
Read post2019
June 2019
[ICML 2019] Day 4 - Interpretability, Natural Language Processing, Smarter than AI four year old kids, Unsupervised Learning
[ICML 2019] Day 4 - Interpretability, Natural Language Processing, Smarter than AI four year old kids, Unsupervised Learning
Read post2019
June 2019
[ICML 2019] Day 3 - Robotics, Good ol' Sparse Coding, misc. applications, Transfer, Multitask and Active Learning
[ICML 2019] Day 3 - Robotics, Good ol’ Sparse Coding, misc. applications, Transfer, Multitask and Active Learning
Read post2019
June 2019
[HKML] Hong Kong Machine Learning Meetup Season 1 Episode 11
[HKML] Hong Kong Machine Learning Meetup Season 1 Episode 11
Read post2019
June 2019
[ICML 2019] Day 2 - U.S. Census, Time Series, Hawkes Processes, Shapley values, Topological Data Analysis, Deep Learning & Logic, Random Matrices, Optimal Transport for Graphs
[ICML 2019] Day 2 - U.S. Census, Time Series, Hawkes Processes, Shapley values, Topological Data Analysis, Deep Learning & Logic, Random Matrices, Optimal Transport for Graphs
Read post2019
June 2019
[ICML 2019] Day 1 - Tutorials
[ICML 2019] Day 1 - Tutorials
Read post2019
May 2019
Experimenting with LIME - A tool for model-agnostic explanations of Machine Learning models
Experimenting with LIME - A tool for model-agnostic explanations of Machine Learning models
Read post2019
May 2019
[ICML 2019] Reading list of accepted papers
[ICML 2019] Reading list of accepted papers
Read post2019
May 2019
[HKML] Hong Kong Machine Learning Meetup Season 1 Episode 10
[HKML] Hong Kong Machine Learning Meetup Season 1 Episode 10
Read post2019
May 2019
May the Fourth: VADER for Credit Sentiment?
May the Fourth: VADER for Credit Sentiment?
Read post2019
May 2019
Snorkel Credit Sentiment - Part 1
First experiment with Snorkel Metal – Credit Sentiment on DataGrapple blogs
Read post2019
April 2019
[HKML] Hong Kong Machine Learning Meetup Season 1 Episode 9
[HKML] Hong Kong Machine Learning Meetup Season 1 Episode 9
Read post2019
March 2019
[HKML] Hong Kong Machine Learning Meetup Season 1 Episode 8
[HKML] Hong Kong Machine Learning Meetup Season 1 Episode 8
Read post2019
February 2019
[HKML] Hong Kong Machine Learning Meetup Season 1 Episode 7
[HKML] Hong Kong Machine Learning Meetup Season 1 Episode 7
Read post2019
January 2019
[HKML] Hong Kong Machine Learning Meetup Season 1 Episode 6
[HKML] Hong Kong Machine Learning Meetup Season 1 Episode 6
Read post2018
December 2018
[HKML] Hong Kong Machine Learning Meetup Season 1 Episode 5
[HKML] Hong Kong Machine Learning Meetup Season 1 Episode 5
Read post2018
December 2018
[Paper] A Backtesting Protocol in the Era of Machine Learning
[Paper] A Backtesting Protocol in the Era of Machine Learning
Read post2018
November 2018
[HKML] Hong Kong Machine Learning Meetup Season 1 Episode 4
[HKML] Hong Kong Machine Learning Meetup Season 1 Episode 4
Read post2018
October 2018
[Book] The UnRules - Man, Machines and the Quest to Master Markets
[Book] The UnRules - Man, Machines and the Quest to Master Markets
Read post2018
October 2018
Hierarchical Risk Parity - Implementation & Experiments (Part II)
Hierarchical Risk Parity - Implementation & Experiments (Part II)
Read post2018
October 2018
Network-based vs. Minimum Variance portfolios: Any deep connections?
Network-based vs. Minimum Variance portfolios: Any deep connections?
Read post2018
October 2018
[HKML] Hong Kong Machine Learning Meetup Season 1 Episode 3
[HKML] Hong Kong Machine Learning Meetup Season 1 Episode 3
Read post2018
October 2018
How to sample uniformly over the space of correlation matrices? The onion method
How to sample uniformly over the space of correlation matrices? The onion method
Read post2018
October 2018
Hierarchical Risk Parity - Implementation & Experiments (Part I)
Hierarchical Risk Parity - Implementation & Experiments (Part I)
Read post2018
September 2018
[Book] Neural Network Methods for Natural Language Processing
[Book] Neural Network Methods for Natural Language Processing
Read post2018
September 2018
``Combination of Rankings'' - The Stationary Distribution
``Combination of Rankings’’ - The Stationary Distribution
Read post2018
September 2018
``Combination of Rankings'' - The Full Coverage Case
``Combination of Rankings’’ - The Full Coverage Case
Read post2018
September 2018
[Bloomberg Meetup] The Forefront of Technologies in Finance
[Bloomberg Meetup] The Forefront of Technologies in Finance
Read post2018
August 2018
A Monte Carlo study of the ``Combination of Rankings'' methods
A Monte Carlo study of the ``Combination of Rankings’’ methods
Read post2018
August 2018
[HKML] Hong Kong Machine Learning Meetup Season 1 Episode 2
[HKML] Hong Kong Machine Learning Meetup Season 1 Episode 2
Read post2018
August 2018
Combination of Rankings
Combination of Rankings - A Proper Merging of Experts Views
Read post2018
July 2018
[HKML] Hong Kong Machine Learning Meetup Season 1 Episode 1
[HKML] Hong Kong Machine Learning Meetup Season 1 Episode 1
Read post2018
July 2018
[ICML 2018] Retrospectives
[ICML 2018] Retrospectives
Read post2018
July 2018
[ICML 2018] Day 4 - Time-Series Analysis, NLP, and More Human-like Learning Machines
[ICML 2018] Day 4 - Time-Series Analysis, NLP, and More Human-like Learning Machines
Read post2018
July 2018
[ICML 2018] Day 3 - Energy, GANs, Rankings, Curriculum Learning, and our paper
[ICML 2018] Day 3 - Energy, GANs, Rankings, Curriculum Learning, and our paper
Read post2018
July 2018
[ICML 2018] Day 2 - Representation Learning, Networks and Relational Learning
[ICML 2018] Day 2 - Representation Learning, Networks and Relational Learning
Read post2018
July 2018
[ICML 2018] Day 1 - Tutorials
[ICML 2018] Day 1 - Tutorials
Read post2018
June 2018
On the difficulty of reading numbers in different languages
On the difficulty of reading numbers in different languages
Read post2018
June 2018
Neural Style Transfer applied to paintings
In this short blog, we apply the fast style transfer as implemented in tensorflow/magenta.
Read post2018
June 2018
How to compute the Planar Maximally Filtered Graph (PMFG)
How to compute the Planar Maximally Filtered Graph (PMFG)
Read post2018
May 2018
How to detect false strategies? The Deflated Sharpe Ratio
Deflated Sharpe Ratio
Read post2018
May 2018
APIs for getting crypto related data
Following my blog post Download & Play with Cryptocurrencies Historical Data in Python, I got several times questions on how to get the historical data. In the previous blog, I used a Python wr...
Read post2018
March 2018
Physiological analytics for sports
I recently got a Garmin Forerunner 935 (advised by a good friend of mine). After using it for two months, I can say that I’m happy with it so far. It has lots of functionalities and analytics that ...
Read post2018
February 2018
AQR Academic Factors
AQR has released an implementation of the well-known academic factors in its AQR Data Library:
Read post2018
February 2018
Quant Blogs
Here a tentative of a list of interesting blogs to keep up with quant best practices to study financial markets. I hope that my readers will help me curate this list by suggesting new blogs, removi...
Read post2017
December 2017
Tail Dependence Coefficients
Research material:
Read post2017
November 2017
Riemannian Geometry of Correlation Matrices
Research material:
Read post2017
November 2017
PhD defense - Some contributions to the clustering of financial time series
Here are the slides. The PhD studies were generously funded by Hellebore Capital.
Read post2017
November 2017
[Correlation] How to visualize dependence between two variables?
In this blog, we provide a snippet of code to explore the dependence between two variables. We illustrate its use on visualizing the dependence between a few of the main cryptocurrencies: Bitcoin (...
Read post2017
September 2017
[Clustering] How to sort a distance matrix
Following the Ecole Polytechnique - Data Science Summer School where I got several times questions about how I produced the sorted correlation matrices displayed in my poster, I decided to write ...
Read post2017
September 2017
[Field report] Data Science Summer School at Ecole Polytechnique (with Bengio, Russell, Bousquet, Archambeau and others)
A small field report with personal viewpoint about the Data Science Summer School (Ecole Polytechnique) Monday, Aug. 28 – Friday, Sept. 1, 2017.
Read post2017
August 2017
Download & Play with Cryptocurrencies Historical Data in Python
To access the CryptoCompare public API in Python, we can use the following Python wrapper available on GitHub: cryCompare.
Read post2017
August 2017
Reading list of NLP stuff
General NLP:
Read post2017
August 2017
Quick correlation study between BTC/USD and ETH/USD
import numpy as np import scipy import pandas as pd import matplotlib.pyplot as plt %matplotlib inline import seaborn as sns import json from datetime import timedelta
Read post2017
August 2017
Field reports from ICML 2017 in Sydney
My colleague, Mikolaj Binkowski, at Hellebore Capital was at the 34th International Conference on Machine Learning ICML 2017 in Sydney to represent the company and present his work on Deep Learnin...
Read post2017
July 2017
Study of US Stocks Correlations, Hierarchies and Clusters
In this small study, we use hierarchical clustering techniques to explore the structure of correlations between US stocks. To do so, we first download a dataset of adjusted close prices for the US ...
Read post2017
June 2017
Ether vs. Bitcoin -- Part 0 bis
For the last few days, ETH has lost 1/3 of its value with -20% several days in a row. We update the initial study with up-to-date data to take into account this recent drawback. We add a comment on...
Read post2017
June 2017
Ether vs. Bitcoin -- Part 0
In this introduction notebook, we simply displayed the distribution of the returns and see that tails are heavy, meaning that standard quant models cannot be applied to cryptocurrencies either…
Read post2017
June 2017
June Ethereum London Meetup at Imperial College
I attended this evening the June Ethereum London Meetup at Imperial College (I have been there a couple of times before). Imperial College’s big amphitheater in Sir Alexander Fleming Building was f...
Read post2017
May 2017
Swap Data Repositories for Credit Default Swaps
What are Swap Data Repositories?
Read post