Extract repositories data from GitHub and deliver via Webhooks automatically
Navigate to the GitHub Repositories scraper in Scrapernode. Select "Fresh Scrape" for real-time data or "Quick Lookup" for pre-collected records. Each record costs 2 credits.
Add your endpoint URL in the Scrapernode webhook settings. Each completed job sends a POST request with the full JSON payload, including all output fields and metadata.
Paste the GitHub URLs you want to scrape — one per line, or upload a CSV. Scrapernode accepts direct profile links, search result URLs, and content pages.
Click "Start Extraction" to begin. Scrapernode handles proxy rotation, rate limiting, and anti-bot detection automatically. Jobs typically complete in under 60 seconds per batch.
When the job completes, Scrapernode delivers the full JSON payload to your endpoint. Each record includes 20 structured fields like url, id, code_language, code. Parse the payload in your application and process the data as needed.
Cost per record
2 credits
Output fields
20 fields
Destination
Webhooks
Preview the data you'll receive — 5 sample records
20 fields returned per record
Repository web address (100.00% fill rate)
Unique repository ID (100.00% fill rate)
Main programming language used in the repository (79.55% fill rate)
Repository source code files (86.22% fill rate)
Sub-fields
file_nameTextName of the source code filefile_pathTextPath to the file in the repositoryfile_contentTextContent of the source code fileTotal lines of code in the repository (100.00% fill rate)
Repository owner's username (100.00% fill rate)
Owner's GitHub profile URL (100.00% fill rate)
Repository size with units (100.00% fill rate)
Repository size measurement units (KB, MB, GB) (100.00% fill rate)
Repository size as a numeric value (100.00% fill rate)
Repository navigation path and hierarchy (100.00% fill rate)
Sub-fields
nameTextBreadcrumb navigation element nameurlTextURL of the breadcrumb navigation elementTotal count of issues in the repository (100.00% fill rate)
Total count of pull requests (100.00% fill rate)
Number of associated GitHub projects (100.00% fill rate)
Number of times the repository has been forked (100.00% fill rate)
Number of stars the repository has received (100.00% fill rate)
Description of the latest feature or change (99.98% fill rate)
Date of the most recent repository update (99.99% fill rate)
Repository website URL from the About section (72.75% fill rate)
Repository license information (99.83% fill rate)
Sub-fields
nameTextLicense nameurlTextURL to the license detailsCommon questions about How to Scrape GitHub Repositories into Webhooks
Start extracting github repositories data and deliver via Webhooks in minutes.