Final Thesis: Crawling Code Review Data From Phabricator

Abstract: Modern code review is typically supported by software tools. Researchers use data tracked by these tools to study code review practices. A popular tool in open-source and closed-source projects is Phabricator. However, there is no tool to crawl all the available code review data from Phabricator hosts. In this thesis, we develop a Python crawler named Phabry, for crawling code review data from Phabricator instances using its REST API. The tool produces minimal server and client load, reproducible crawling runs, and stores complete and genuine review data. The new tool is used to crawl the Phabricator instances of the open source projects FreeBSD, KDE and LLVM. The resulting data sets can be used by researchers.

PDFs: Master Thesis, Work Description

Reference: Dumitru Cotet. Crawling Code Review Data From Phabricator. Master Thesis, Friedrich-Alexander-Universität Erlangen-Nürnberg: 2019.