Carlos Petricioli

October 28, 2014

Data Science for Social Good

The Eric & Wendy Schmidt Data Science for Social Good fellowship is a University of Chicago summer program for aspiring data scientists to work on data mining, machine learning, big data, and data science projects with social impact.

Learn more...

Carlos Petricioli

October 07, 2014

Stanford Network Analysis Project, SNAP

Stanford Network Analysis Platform (SNAP) is a general purpose network analysis and graph mining library. SNAP was originally developed by Jure Leskovec, assistant professor of the Computer Science at Stanford University, in the course of his PhD studies. It includes SNAP for C++, Snap.py, a Stanford Large Network Dataset Collection and a tutorial. It's great for Large Scale Network Analytics.

Learn More...

Carlos Petricioli

August 24, 2014

DSSG - World bank project Blog

The World Bank Group lends billions of dollars each year to fund development projects in its efforts to reduce global poverty. My team and I, are helping investigators at the Bank search for patterns of collusion, corruption, and fraud in its contracts data, using models of contract-specific risk. Developing an automated approach to detecting these offenses can help the World Bank efficiently target future investigations.

Continue Reading...

Carlos Petricioli

May 22, 2014

Data Science for Business

Data Science for Business is a new book by Foster Provost and Tom Fawcett intended for those who need to understand data science/data mining, and those who want to develop their skill at data-analytic thinking. Data Science for Business is not a book about algorithms. Instead it presents a set of fundamental principles for extracting useful knowledge from data.

Learn More...

Carlos Petricioli

April 05, 2014

Modeling Science: Dynamic Topic Models of Scholarly Research

In this video, professor David M. Blei gives an example of how to use Topic models. In his words, "Topic models that are a suite of algorithms that uncover the hidden thematic structure in document collections. These algorithms help us develop new ways to search, browse and summarize large archives of texts."

Carlos Petricioli

March 28, 2014

SUPPLE: Automatically Generating Personalized User Interfaces

The project was developed by Krzysztof Gajos (Harvard University), Daniel Weld (University of Washington), and Jacob Wobbrock (University of Washington). Supple uses decision-theoretic optimization to automatically generate user interfaces adapted to a person's abilities, devices, preferences, and tasks. In particular, SUPPLE can generate user interfaces for people with motor and vision impairments.

Learn more...

Carlos Petricioli

June 12, 2014

Clean Development: Data Mining for Corruption Risks

Every year, the World Bank Group lends billions of dollars to fund infrastructure and other development projects around the globe. Projects vary widely in scale and scope, ranging from developing hydropower systems to rehabilitating coral reefs to improving roads, health, education and agriculture systems. Contractors on these projects are typically hired through a competitive bidding process, in which prospective suppliers of labor, raw materials, or equipment submit proposals to complete the necessary work. Occasionally, prospective contractors attempt to game the competitive system in some way by colluding with other contractors, bribing government officials, or otherwise manipulating the system. Collusion has far-reaching effects on the price and quality of contract delivery, and is difficult to detect.

Continue Reading...

Carlos Petricioli

January 01, 2014

Install Postgres on a Mac OSX

This blog is a great wat to install Postgres on a Mac using Homebrew.

Carlos Petricioli

December, 2013

Bayesian or Frequentist, Which Are You?

This is a great lecture from Professor Michael I. Jordan, professor at the Department of EECS, UC. Berkeley, to understand the differences between Bayessian and Frequentist statistical approaches.


Bayesian or Frequentist, Which Are You?

Michael I. Jordan
2 videos

Download Slides