Interactive Subreddit Map with t-SNE
For part of my presentation at Montreal Python, I made an interactive map of the various sub-sections of the website Reddit (called subreddits). You can take a look at the interactive version or see a...
View ArticleData Science and (Unsupervised) Machine Learning with scikit-learn
I gave a talk at Montreal Python this week, and here are the materials and video. As part of this presentation I made a neat little visualization of the structure of the website Reddit. Catégorie:...
View ArticleObject Lifecycle Management with S3 Lifecycle Editor
As Sunil recently pointed out, one needs to be careful with their usage of the cloud. One of the easy traps to fall into is related to the Simple Storage Service (S3), where you can basically write...
View ArticleMost Popular Content of 2014 - Data Science, Visualizations & RTB Pacing
2014 was a great year for Datacratic. In addition to creating great technology and products, we hosted global meetups, held informative virtual events and created content to keep everyone up to date on...
View ArticleReal-Time bidding optimization: a behind-the-scenes look at Datacratic's...
Recently I presented Real-Time bidding optimization: a behind-the-scenes look at Datacratic's predictive API at PAPI's. PAPI's.io is the premier International Conference on Predictive APIs and Apps...
View ArticleHow to Apply Social Ranking for Display Campaign Optimization
Today’s marketer is no stranger to SEO and social media. Yet if you ask how to optimize this data in the domain of programmatic display, you’ll probably hear crickets. And browsing the internet for...
View ArticleThe Programmatic Waterfall Mystery: Solved
A recent article on AdExchanger asks “In the supposedly super-efficient world of RTB, why would publishers continue to waterfall their demand sources?”. The article goes on to say that the publisher’s...
View Article2015 E-Commerce Technology Adoption Report
Our partner, AdGear, is excited to share infographic version of Top 1300 E-Commerce Operators - Marketing Technology Adoption Report for 2015.Catégorie: TechnologieMots clés: retailer technology...
View ArticleEpic NHL goal celebration hack with a hue light show and real time machine...
Now that the Stanley Cup playoffs have started and every other person you cross on the sidewalk is wearing a Montreal Canadiens jersey, I turned my attention to solving a serious machine learning...
View ArticleParsers printing rule: make sure you print what you parsed
In the last years, I've had various encounters with different parsers behaviours. I wrote a blog entry on what parsers should do to avoid being frustrating.Catégorie: Technologie
View ArticleStarCluster: Multiple node instance type support
Last week I released a new feature for the vanilla_improvements branch of StarCluster: multiple instance type support. It means that our cluster can now select the instance type to bid on depending on...
View ArticleLectures du mois: The Effective Engineer et Big Data
Au cours du dernier mois j'ai lu The Effective Engineer et Big Data. Suivez les liens pour accéder à leur revue respective.Catégorie: Technologie
View Article1M QPS with nginx and Ubuntu 12.04 on EC2
We have been doing quite a few tests lately to understand what is the maximum number of HTTP queries per second (QPS) that a modern server running Ubuntu 12.04 with a recent Linux kernel could...
View ArticleDrag'n'Drop Pivot Tables and Charts, in Jupyter/IPython Notebook
PivotTable.js is a Javascript Pivot Table and Pivot Chart library with drag’n’drop interactivity, and it’s now available integrated into Jupyter/IPython Notebook. It’s been available to RStudio users...
View ArticleMapping Press Releases in the 2015 Canadian Federal Election
The 2015 Canadian federal election is in its final stretch and Datacratic's data science team thought it would be a great opportunity to collect some data and do some machine learning. Citizen data...
View ArticleNeedles and haystacks: finding the one bad request among billions with tcpdump
This week, we had a few weird crashes with an HTTP server which we could not easily reproduce and we had a hard time pin-pointing the source of the issue. We knew the problems were triggered by bad...
View ArticleMachine Learning Meets Economics
The business world is full of streams of items that need to be filtered or evaluated: parts on an assembly line, resumés in an application pile, emails in a delivery queue, transactions awaiting...
View ArticleApplied Auction Theory in Online Advertising
I was recently invited to give a talk about auction theory and online advertising at Concordia University for a course entitled Social and Information Networks, which uses a really interesting textbook...
View Article