scrapernode
  • All Platforms
  • Use Cases
  • Categories
  • Data Points
  • How-To Guides
  • Integrations
  • Compare
Platforms
  • LinkedInLinkedIn5
  • Google MapsGoogle Maps2
  • InstagramInstagram4
  • TikTokTikTok8
  • XTwitter/X2
  • YouTubeYouTube3
  • FacebookFacebook10
  • Jobs
  • Billing
  • Docs
  • Settings

© 2026 Scrapernode

scrapernode
PlatformsUse CasesHow-ToJobsBilling
ScrapeLinkedInLinkedInprofiles by search term
Scrape LinkedIn profiles by search term. Extract Google Maps business listings and reviews. Collect Facebook pages, groups and posts. Scrape Instagram profiles, reels and comments. Extract TikTok posts and creator profiles. Collect YouTube channels and video data. Scrape X / Twitter profiles and posts. Extract Indeed job listings and salaries. Collect Yelp business reviews and ratings.
Home/Integrations/How to Scrape GitHub Repositories into Google Sheets
GitHubGoogle Sheets

How to Scrape GitHub Repositories into Google Sheets

Extract repositories data from GitHub and sync to Google Sheets automatically

Step-by-step guide

1

Choose the GitHub Repositories scraper

Navigate to the GitHub Repositories scraper in Scrapernode. Select "Fresh Scrape" for real-time data or "Quick Lookup" for pre-collected records. Each record costs 2 credits.

2

Set up your Google Sheets connection

Connect your Google account and select the target spreadsheet. Scrapernode will auto-create columns matching the scraper's output fields and append new rows as data arrives.

3

Provide your GitHub input URLs

Paste the GitHub URLs you want to scrape — one per line, or upload a CSV. Scrapernode accepts direct profile links, search result URLs, and content pages.

4

Launch the scraping job

Click "Start Extraction" to begin. Scrapernode handles proxy rotation, rate limiting, and anti-bot detection automatically. Jobs typically complete in under 60 seconds per batch.

5

Receive data in Google Sheets

Once the job completes, results are automatically synced to your spreadsheet. Each row contains one record with 20 structured fields like url, id, code_language, code, and more. New jobs append additional rows automatically.

Cost per record

2 credits

Output fields

20 fields

Destination

Google Sheets

Sample Output

Preview the data you'll receive — 5 sample records

Record 1 of 5
Url
sample_url
Id
ACoAABcXYZ123
Code Language
sample_code_language
Code
sample_code
Num Lines
1,000
User Name
James K.
User Url
sample_user_url
Size
51-200
Size Unit
sample_size_unit
Size Num
1,000
Breadcrumbs
sample_breadcrumbs
Num Issues
1,000
Num Pull Requests
1,000
Num Projects
1,000
Num Fork
1,000
Num Stared
1,000
Last Feature
sample_last_feature
Latest Update
sample_latest_update
Website Url
sample_website_url
License
sample_license

Data Dictionary

20 fields returned per record

Repository web address (100.00% fill rate)

Unique repository ID (100.00% fill rate)

Main programming language used in the repository (79.55% fill rate)

Repository source code files (86.22% fill rate)

Sub-fields

file_nameTextName of the source code file
file_pathTextPath to the file in the repository
file_contentTextContent of the source code file

Total lines of code in the repository (100.00% fill rate)

Repository owner's username (100.00% fill rate)

Owner's GitHub profile URL (100.00% fill rate)

Repository size with units (100.00% fill rate)

Repository size measurement units (KB, MB, GB) (100.00% fill rate)

Repository size as a numeric value (100.00% fill rate)

Repository navigation path and hierarchy (100.00% fill rate)

Sub-fields

nameTextBreadcrumb navigation element name
urlTextURL of the breadcrumb navigation element

Total count of issues in the repository (100.00% fill rate)

Total count of pull requests (100.00% fill rate)

Number of associated GitHub projects (100.00% fill rate)

Number of times the repository has been forked (100.00% fill rate)

Number of stars the repository has received (100.00% fill rate)

Description of the latest feature or change (99.98% fill rate)

Date of the most recent repository update (99.99% fill rate)

Repository website URL from the About section (72.75% fill rate)

Repository license information (99.83% fill rate)

Sub-fields

nameTextLicense name
urlTextURL to the license details

Frequently Asked Questions

Common questions about How to Scrape GitHub Repositories into Google Sheets

Ready to connect GitHub data to Google Sheets?

Start extracting github repositories data and sync to Google Sheets in minutes.

Go to GitHub Repositories scraperBrowse all integration guides
No code requiredAuto-delivery to Google Sheets20 data fields