Statistics and OpenData enable network effects, data reuse, and collaborative revision of our project. But they are tricky and can’t be released carelessly:
impressions is what our browser extension might collect. Public are the only one we consider, it is a decision take in the browser, based on the visibility configured by the content author. Posts only for friends or with Custom audience are considered private
timelines are the number of newsfeed observed by the browser extension. skipped are all the pages except the newsfeed, which are excluded from being collected.
below a graph on how our parsers are performing: how many HTMLs have been parsed successfully or not