Extract repositories data from GitHub at scale
Navigate to the GitHub Repositories scraper and select "Fresh Scrape" for real-time data or "Quick Lookup" for pre-collected records. Each record costs 2 credits.
Paste the GitHub URLs you want to scrape — one per line, or upload a CSV. Scrapernode accepts direct profile links, search result URLs, and hashtag pages.
Click "Start Extraction" to begin. Scrapernode handles proxy rotation, rate limiting, and anti-bot detection automatically. Jobs typically complete in under 60 seconds per batch.
Once complete, download your results as JSON or CSV. Each record includes 20 structured fields like url, id, code_language, code, and more.
Set up webhooks to receive data automatically when jobs complete, or use the REST API for programmatic scraping. Integrate with n8n, Make, or Zapier for workflow automation.
Cost per record
2 credits
Output fields
20 fields
Output formats
JSON, CSV
Preview the data you'll receive — 5 sample records
20 fields returned per record
Repository web address (100.00% fill rate)
Unique repository ID (100.00% fill rate)
Main programming language used in the repository (79.55% fill rate)
Repository source code files (86.22% fill rate)
Sub-fields
file_nameTextName of the source code filefile_pathTextPath to the file in the repositoryfile_contentTextContent of the source code fileTotal lines of code in the repository (100.00% fill rate)
Repository owner's username (100.00% fill rate)
Owner's GitHub profile URL (100.00% fill rate)
Repository size with units (100.00% fill rate)
Repository size measurement units (KB, MB, GB) (100.00% fill rate)
Repository size as a numeric value (100.00% fill rate)
Repository navigation path and hierarchy (100.00% fill rate)
Sub-fields
nameTextBreadcrumb navigation element nameurlTextURL of the breadcrumb navigation elementTotal count of issues in the repository (100.00% fill rate)
Total count of pull requests (100.00% fill rate)
Number of associated GitHub projects (100.00% fill rate)
Number of times the repository has been forked (100.00% fill rate)
Number of stars the repository has received (100.00% fill rate)
Description of the latest feature or change (99.98% fill rate)
Date of the most recent repository update (99.99% fill rate)
Repository website URL from the About section (72.75% fill rate)
Repository license information (99.83% fill rate)
Sub-fields
nameTextLicense nameurlTextURL to the license detailsCommon questions about How to Scrape GitHub Repositories
Start extracting github repositories data in minutes. No code required — just paste your URLs and go.