gitDigger: Creating useful wordlists from public GitHub repositories

Presented at DEF CON 21 (2013), Aug. 4, 2013, 10 a.m. (20 minutes)

This presentation intends to cover the thought process and logistics behind building a better wordlist using github public repositories as its source. With an estimated 2,000,000 github projects to date, how would one store that amount of data? Would you even want or need to? After downloading approximately 500,000 repositories, storing 6TB on multiple usb drives; this will be a story of one computer, bandwidth, basic python and how a small idea quickly got out of hand.


Presenters:

  • Rob Fuller / mubix as Rob Fuller (Mubix)
    Rob Fuller (Mubix) is a Senior Red Teamer. His professional experience start from his time on active duty as United States Marine. He has worked with devices and software that run gambit in the security realm. He has a few certifications that haven't expired yet, but the titles that he holds above the rest is father, husband, and United States Marine.
  • Jaime Filson / WiK as Jaime Filson (WiK)
    Jaime Filson (WiK) ell, WiK's just zis guy. He enjoys long walks on the beach while his computer equipment is busy fuzzing software, cracking passwords, or spidering the internet.

Links:

Similar Presentations: