How Much Does a GitHub Star Cost? Detecting Fake Stars with AI

GitHub stars, while a vanity metric, influence project selection and investment decisions, leading to a market where stars are bought and sold; this article examines star pricing, common fraud patterns, and how unsupervised clustering and specialized tools can identify and mitigate fake-star activity.

Liangxu Linux
Liangxu Linux
Liangxu Linux
How Much Does a GitHub Star Cost? Detecting Fake Stars with AI

Star Pricing

Public channels sell GitHub stars, for example the sites GitHub24 and Baddhi Shop. Prices vary widely: about 64 USD can buy 1,000 stars from three‑zero accounts, while obtaining 100 active‑behavior stars may cost around 85 USD.

How to Identify Fake Stars

Fake‑star accounts often share obvious traits: they are created on the same day, have fewer than one follower, fewer than one following, own fewer than four public repositories, and many profile fields (email, bio) are empty. GitHub periodically removes such accounts.

Beyond GitHub’s own cleanup, third‑party tools can help spot fake accounts, such as:

astronomer
fake-star-detector

Detecting High‑End Fraud with Clustering

High‑price star purchases are harder to detect because the accounts mimic normal developer behavior. The open‑source orchestration platform dagster demonstrates a detection approach using unsupervised clustering. Each GitHub account is represented as a high‑dimensional vector based on features like:

Code commits

Pull‑request activity

Stars given to projects

Profile edits

Accounts that cluster together in this feature space are likely to belong to the same class. By training on known fake accounts, the model flags nearby accounts as suspicious.

In a case study, a repository deliberately purchased many stars was analyzed. The clustering plot showed:

Blue points: all users

Red points: confirmed fake accounts

Yellow points: accounts suspected by the clustering model

Since every star in that repository was known to be bought, the yellow points were also fake, confirming the model’s effectiveness.

Conversely, for a genuine project such as dagster, the star‑giving behavior of real users does not intersect with the patterns of fake accounts, making false positives unlikely.

Real‑World Impact

Although stars are a vanity metric, many decisions—technical selection, investment, recruitment—rely on them, creating demand for artificial inflation. For example, the open‑source cryptocurrency okcash shows 578 stars, but clustering analysis reveals that 97 % of the accounts that starred it are likely fake, which could undermine confidence in the project.

Practitioners can apply the same clustering pipeline (see the provided GitHub links for the simple and complex models) to audit other open‑source libraries for star fraud.

References

Baddhi Shop: https://baddhi.shop/shop/

zadahmed/music_recommender stargazers: https://github.com/zadahmed/music_recommender/stargazers

astronomer: https://github.com/Ullaakut/astronomer

fake-star-detector code: https://github.com/dagster-io/fake-star-detector/blob/main/fake_star_detector/assets/simpler_model.py

dagster blog on fake stars: https://dagster.io/blog/fake-stars#lets-go-star-shopping

dagster repository: https://github.com/dagster-io/dagster

okcash repository: https://github.com/okcashpro/okcash

Complex model for detection: https://github.com/dagster-io/fake-star-detector/blob/main/fake_star_detector/assets/complex_model.py

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

open-sourceGitHubsoftware metricsunsupervised clusteringstarsfake detection
Liangxu Linux
Written by

Liangxu Linux

Liangxu, a self‑taught IT professional now working as a Linux development engineer at a Fortune 500 multinational, shares extensive Linux knowledge—fundamentals, applications, tools, plus Git, databases, Raspberry Pi, etc. (Reply “Linux” to receive essential resources.)

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.