Bonum Certa Men Certa

Maintaining the 'Delete Github' page

By figosdev

GitHub RepositoriesSummary: "This list really is a starting point, which can hopefully increase awareness about the issue of concern."

THE primary goal of the Delete Github page is an early-warning system for projects that Microsoft has undue control over -- they can restrict access, delete projects, or (perhaps in a worst-case scenario) either deliberately modify a project download as SourceForge once did, or simply act a vector for malicious activity -- as has already happened to Gentoo and Canonical.



"At this stage there is an effort to be as comprehensive as possible -- to include as many mainstream Free software projects as we can, to give people a better idea of how many projects are potentially threatened by a monopoly."As with higher quality warning systems, including all anti-malware software, false positives are a known possibility. Some care is taken to avoid them, simply to increase the quality of the list itself. At this stage there is an effort to be as comprehensive as possible -- to include as many mainstream Free software projects as we can, to give people a better idea of how many projects are potentially threatened by a monopoly.

Admittedly, there is a lower threshold of evidence to get on the list than to be removed. While people may decide to make decisions based on this list, we hope they would double-check our findings first. This is research, it is not law -- the list includes IceWM, which I use routinely, and JWM, which I have running as a process as we speak. It includes Leafpad, which I am using to type this very line of text. If we find evidence that WordPress is using Github (and there is a repo for it, I haven't checked it yet and it isn't on the list yet) it is unlikely that Roy will decide to remove it from the Techrights servers.

"To some of us, this list is like an endangered species list -- we are concerned about the future of projects that are hosted on Github."If a clerical error places an allergy you don't have on your medical records, it is very possible that when this error is found, a single line will be drawn through the note. That is the sort of response that will most likely come of discovering errors in the list. A prominent note (perhaps in bold text) is the most likely correction.

To some of us, this list is like an endangered species list -- we are concerned about the future of projects that are hosted on Github. And we are most eager to update the status of each one (even possibly Systemd, not necessarily for any wishes of success.)

But how can we know that we are in error? Certainly there is some level of assurance we can be given, most likely we will try to re-evaluate items that are called into question as our priorities allow. But we can check on the first one during the writing of this article:

"We will also probably use this lowercase convention for projects that abandon Github for other repos -- if it's all lowercase, that means we no longer believe the project is relying on Github.""OpenBSD does not use Github for development at all Roy. They do not even use git!"

"I'm sure there are others in the list that also do not use github."

Hopefully so. And by no means is Wikipedia a facts-only source of information, but if we simply grep https://en.wikipedia.org/wiki/OpenBSD:

$ leafpad <(wget -O- https://en.wikipedia.org/wiki/OpenBSD | tr ">" "\n" | grep ithub)

We get this:

<a rel="nofollow" class="external text" href="https://github.com/openbsd"

"When this list started, it was a completely manual process."Now, it's possible that OpenBSD doesn't use this repo. If so, the article should be corrected. Let's increase the quality of this check and visit the actual article: https://en.wikipedia.org/wiki/OpenBSD

Ah, now -- one of the not-yet-implemented (though already considered) quality checks is to verify if the Github link is in the summary box (or whatever it's called, I'm not a Wikipedian) on the top right. Selecting it and right-clicking, "View Selection Source" we find this is probably called an "infobox" and that it begins with:

<table class="infobox vevent"

It completes with "</tbody></table>" which we can use in the future to parse infoboxes.

Doing a further search for our Github link, we find the non-url text of our link is in the "External links" section, titled "Github mirror".

"There are still checks in place, but in the writing of this article we have found things we can do to refine the process."What have we learned:

1. The Github link isn't in the infobox section, which decreases the significance of the link.

2. The official repo is https://cvsweb.openbsd.org/cgi-bin/cvsweb/ this is actually the most pertinent information.

3. The text of the very link that put OpenBSD on our list was labeled "Github mirror" which isn't necessarily criteria we want or need to add projects to this list.

"Obviously people who are more intimately familiar with these projects are encouraged to help us, as arm did."I would say this warrants a correction, which I will issue now before the article is finished. Since the first letter of every list entry is capitalised, we will make it easier to parse / automatically filter corrected entries by making the title all lowercase: OpenBSD -> openbsd. And... updated. Thanks to arm for setting this straight.

We will also probably use this lowercase convention for projects that abandon Github for other repos -- if it's all lowercase, that means we no longer believe the project is relying on Github.

When this list started, it was a completely manual process. Since then, (only) some aspects have drifted towards rudimentary automation. This has produced a more complete list, with a slightly higher tendency towards false positives. There are still checks in place, but in the writing of this article we have found things we can do to refine the process.

"For the moment, the highest priority is still adding to the list, so we may not double check every existing entry right away."Even after items are added, we can use this new information to double-check the entries (even all at once.) That doesn't mean this will happen today, but it may ultimately make it easier to discover when projects abandon Github, which would be a bonus. Obviously people who are more intimately familiar with these projects are encouraged to help us, as arm did.

For the moment, the highest priority is still adding to the list, so we may not double check every existing entry right away. This is the stage where the list is built as large as possible, within reason and utility.

At no point has the process become entirely automated, though I did just parse all of English Wikipedia for Github entries. Wikipedia will not be (is not) our only source, entries do not automatically go from discovery to inclusion (they are checked further, and also chosen manually for relevance, but please feel very free to suggest items you think are worth adding) but we will now try to slightly refine the process of adding new entries, first.

"We also want to encourage projects to leave Github."We want everybody to realise that Github holds more mainstream projects than many of us thought. We also want to encourage projects to leave Github. We already know that GNOME and KDE mostly have their own repositories, though at the moment it appears that Kate for example, may use Github for something (that is still being looked into.) It's surprising to discover gnuradio on Github -- we are also aware that some of these (like OpenBSD) are just mirrors.

This list really is a starting point, which can hopefully increase awareness about the issue of concern. As awareness increases, we fully expect the quality of this information to be further refined -- not unlike the software projects listed here as talented people review them and contribute to the sources.

Thanks again, you've helped make this list better than it was when we started.

Licence: Creative Commons CC0 1.0 (public domain)

Recent Techrights' Posts

Saving What's Left of Decent and Independent Journalism on the Web
We increasingly (over time) try to make local copies (hosted on our server) of important documents; it's hard to rely on third parties
Brian Fagioli's Latest "Linux" Article Appears to be Fake
Another form of plagiarism/ripoff using bots?
[Meme] When the People Who Falsely Accuse You of Pedophilia Turn Out to be Projecting
When you attack something or someone using falsehoods, as happens a lot to Richard Stallman (RMS), there's risk that the attacks will backfire, badly
Why I Continue to Believe That at the End Software Freedom Will Win
a short and incomplete list of factors which I believe contribute to the sentiment that we can - and will - win the battles over hearts and minds in the "Tech" realm
 
Links 18/12/2024: Doha/Qatar Trafficking, Bloat Comfort Zone, and Advent of Code 2024
Links for the day
[Meme] Microsoft's Latest Marketing Pitch
"Stop Being Poor; buy a new PC with TPMs"
In South Africa, a Very Large Nation, Web Developers Can Already Ignore Microsoft Browsers (Edge Measured Below 3% in 55 Nations)
The dumb assumption you must naively test with Microsoft browsers is no longer applicable in a lot of places
Open Source Initiative (OSI) is the Voice of Bill Gates and Satya Nadella
Not hard to see what they've done with the money
Microsoft Boasts That Its (Microsoft-Sponsored) "Open Source AI" Propaganda Got Cited in Media (That's Just What the Money Did)
This is a grotesque openwashing campaign
In Many Places Around the World, Perhaps as Expected, Yandex is Nearly Bigger Than Microsoft (Like in Several African Countries)
Microsoft may soon fall to "third place" in search
Keeping Productive This Christmas
We've (pre)paid for hosting till almost January 2026 and fully back on the saddle
IBM and Canonical Leave Money on the Table Because Microsoft Pays Them Not to Compete and Instead Market Windows, WSL, Microsoft 'Clown Computing', and TPMs
Where are the regulators?
Other Editors Who Agree "Hey Hi" (AI) is Just Hype But Won't Say So Publicly as It Might Upset Key Sponsors
Some media would gladly participate in a scam to make money
IBM (and Red Hat) is a Patent Troll, Still Leveraging Software Patents to Extract Money Out of Other Companies by Suing Them
Basically, when it comes to patents, IBM is demonstrably part of the problem, not the solution
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Tuesday, December 17, 2024
IRC logs for Tuesday, December 17, 2024
In Some Countries, Such as Greece, Almost 80% of Windows Users Are on Vista 10 and About 85% Need to Move to GNU/Linux for Security Patches
Vista 11 was a failure
[Meme] They Don't Want the Public to Know What "Responsible Encryption" Really Means
They also blame "China" for their own back doors (because China learned how to exploit those)
The Linux Foundation's Certificate Authority (CA) Significantly and Suspiciously Raises the Number of Certificates It Issues (Quantity Increase/Inflation) by Lessening Their Lifetime in the Name of 'Security' (That Barely Makes Sense!)
LE made 3 months the "standard" for most, soon to become just 6 days instead of 6 months?
Links 17/12/2024: More China Sanctions, GOP Scheming to Prop Up Fentanylware (TikTok)
Links for the day
Gemini Links 17/12/2024: The Streisand Effect and Productivity-systems Desiderata
Links for the day
Technology: rights or responsibilities? - Part X
By Dr. Andy Farnell
Links 17/12/2024: More "Tesla Autopilot" and "Hey Hi" (AI) Blunders
Links for the day
Instead of Promoting GNU/Linux (or Ubuntu) Ahead of Vista 10's EoL Canonical is Marketing Microsoft's Proprietary Software
It's like Canonical employs people who work for Microsoft, not for Canonical
Links 17/12/2024: Many Abuses by Microsoft and War Updates From Ukraine
Links for the day
Content Management Systems (CMS) Bloat/ Static Site Generators (SSG) Trouble
some Web site management stories
DEI Room at fedoraproject.org Pretty Much Dead
We're not against diversity but against its weaponisation by greedy people who do not value diversity at all
The "Latest Technology News" at BetaNews is Slop About Slop
This is at the very top of the "news" (front page) at the moment
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Monday, December 16, 2024
IRC logs for Monday, December 16, 2024
Gemini Links 16/12/2024: Invisibles and 20 Years of GNU/Linux on the Desktop
Links for the day
Microsoft's Windows Fell From 98% to Less Than 15% (in 15 Years in Africa)
Operating System Market Share Africa
Swaziland: GNU/Linux Leaps to 7.24%, Based on statCounter
Remember that Microsoft had many layoffs this year in Africa
A Birthday Wish
My birthday is a few hours away
[Meme] Definitely Not Your Role Models
Hypocrite Neckbeard Meme
Changes or Variation of Logo at the FSF as 40th Anniversary is Near (Months Away)
Next year the FSF turns 40
Mobile Usage Nearly 90% in Maharlika (Philippines)?
Microsoft has become just a footnote
Push Back and Become More Vocal for LLM Abuse and Misuse to Stop
We hope that more people out there (sites too) will call out the people who saturate particular topics on the Web with machine-generated junk
The Media Failed to Hold GAFAM Accountable (and Now It Suffers From It and For It)
This recognition of the problem emboldens us to carry on
Botswana: New Highs for GNU/Linux, All-Time Lows for Microsoft
No wonder Microsoft has so many layoffs in Africa this year
Links 16/12/2024: Skinnerboxes ("Smart" "Phones") and Control Social Media Blamed for Fights
Links for the day
Reminder: The Microsoft Person Who Used OpenAI for En Masse GPL Violations Told the Whistleblower to Kill Herself
The evidence (real message)
Links 16/12/2024: emacs, Drawabox, “You Should Have Your Own Website”
Links for the day
In Some Parts of the World, Like Central America and South America, Microsoft is Irrelevant on the Web
Nadella has bet the farm on a Ponzi scheme
[Meme] Microsoft is Not a Country
Reporting crimes is essential for democracy
There's Not Much Time Left for President Biden to Pardon Julian Assange and Signal to Journalists That Exposing States' Crimes or Rich People's Misbehaviour is Lawful
Apathy towards this is part of the problem
Image Fusion is Not 'AI' (LLMs Aren't Either)
Such fakes can (and always could) be done by a digital artist, it's just a little more expensive and time-consuming
GNU/Linux at New Highs in Bosnia And Herzegovina
Quite a few Balkan nations show high adoption rates for GNU/Linux
From Scientists to Pigeons: The EPO Has Turned Patent Examination Into a Process Made by Computers and Improperly Trained Staff Which Doesn't Meet the Requirements of the European Patent Convention (EPC)
Might as well abolish this entire system if this is the current trajectory
Razik Menidjel Will No Longer be Chief Operating Officer Operations at the EPO
What does the EPC say about slop and should it be updated to deal with trouble such as slop?
Underpaid and Inexperienced Workers Overwhelm the EPO, Granting Many Invalid Patents and Placing Pressure on Veteran Examiners
So-called "production" (giving monopolies) pressure is "compromising the quality of our products" [sic] according to a new report
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Sunday, December 15, 2024
IRC logs for Sunday, December 15, 2024