Bonum Certa Men Certa

Maintaining the 'Delete Github' page

By figosdev

GitHub RepositoriesSummary: "This list really is a starting point, which can hopefully increase awareness about the issue of concern."

THE primary goal of the Delete Github page is an early-warning system for projects that Microsoft has undue control over -- they can restrict access, delete projects, or (perhaps in a worst-case scenario) either deliberately modify a project download as SourceForge once did, or simply act a vector for malicious activity -- as has already happened to Gentoo and Canonical.



"At this stage there is an effort to be as comprehensive as possible -- to include as many mainstream Free software projects as we can, to give people a better idea of how many projects are potentially threatened by a monopoly."As with higher quality warning systems, including all anti-malware software, false positives are a known possibility. Some care is taken to avoid them, simply to increase the quality of the list itself. At this stage there is an effort to be as comprehensive as possible -- to include as many mainstream Free software projects as we can, to give people a better idea of how many projects are potentially threatened by a monopoly.

Admittedly, there is a lower threshold of evidence to get on the list than to be removed. While people may decide to make decisions based on this list, we hope they would double-check our findings first. This is research, it is not law -- the list includes IceWM, which I use routinely, and JWM, which I have running as a process as we speak. It includes Leafpad, which I am using to type this very line of text. If we find evidence that WordPress is using Github (and there is a repo for it, I haven't checked it yet and it isn't on the list yet) it is unlikely that Roy will decide to remove it from the Techrights servers.

"To some of us, this list is like an endangered species list -- we are concerned about the future of projects that are hosted on Github."If a clerical error places an allergy you don't have on your medical records, it is very possible that when this error is found, a single line will be drawn through the note. That is the sort of response that will most likely come of discovering errors in the list. A prominent note (perhaps in bold text) is the most likely correction.

To some of us, this list is like an endangered species list -- we are concerned about the future of projects that are hosted on Github. And we are most eager to update the status of each one (even possibly Systemd, not necessarily for any wishes of success.)

But how can we know that we are in error? Certainly there is some level of assurance we can be given, most likely we will try to re-evaluate items that are called into question as our priorities allow. But we can check on the first one during the writing of this article:

"We will also probably use this lowercase convention for projects that abandon Github for other repos -- if it's all lowercase, that means we no longer believe the project is relying on Github.""OpenBSD does not use Github for development at all Roy. They do not even use git!"

"I'm sure there are others in the list that also do not use github."

Hopefully so. And by no means is Wikipedia a facts-only source of information, but if we simply grep https://en.wikipedia.org/wiki/OpenBSD:

$ leafpad <(wget -O- https://en.wikipedia.org/wiki/OpenBSD | tr ">" "\n" | grep ithub)

We get this:

<a rel="nofollow" class="external text" href="https://github.com/openbsd"

"When this list started, it was a completely manual process."Now, it's possible that OpenBSD doesn't use this repo. If so, the article should be corrected. Let's increase the quality of this check and visit the actual article: https://en.wikipedia.org/wiki/OpenBSD

Ah, now -- one of the not-yet-implemented (though already considered) quality checks is to verify if the Github link is in the summary box (or whatever it's called, I'm not a Wikipedian) on the top right. Selecting it and right-clicking, "View Selection Source" we find this is probably called an "infobox" and that it begins with:

<table class="infobox vevent"

It completes with "</tbody></table>" which we can use in the future to parse infoboxes.

Doing a further search for our Github link, we find the non-url text of our link is in the "External links" section, titled "Github mirror".

"There are still checks in place, but in the writing of this article we have found things we can do to refine the process."What have we learned:

1. The Github link isn't in the infobox section, which decreases the significance of the link.

2. The official repo is https://cvsweb.openbsd.org/cgi-bin/cvsweb/ this is actually the most pertinent information.

3. The text of the very link that put OpenBSD on our list was labeled "Github mirror" which isn't necessarily criteria we want or need to add projects to this list.

"Obviously people who are more intimately familiar with these projects are encouraged to help us, as arm did."I would say this warrants a correction, which I will issue now before the article is finished. Since the first letter of every list entry is capitalised, we will make it easier to parse / automatically filter corrected entries by making the title all lowercase: OpenBSD -> openbsd. And... updated. Thanks to arm for setting this straight.

We will also probably use this lowercase convention for projects that abandon Github for other repos -- if it's all lowercase, that means we no longer believe the project is relying on Github.

When this list started, it was a completely manual process. Since then, (only) some aspects have drifted towards rudimentary automation. This has produced a more complete list, with a slightly higher tendency towards false positives. There are still checks in place, but in the writing of this article we have found things we can do to refine the process.

"For the moment, the highest priority is still adding to the list, so we may not double check every existing entry right away."Even after items are added, we can use this new information to double-check the entries (even all at once.) That doesn't mean this will happen today, but it may ultimately make it easier to discover when projects abandon Github, which would be a bonus. Obviously people who are more intimately familiar with these projects are encouraged to help us, as arm did.

For the moment, the highest priority is still adding to the list, so we may not double check every existing entry right away. This is the stage where the list is built as large as possible, within reason and utility.

At no point has the process become entirely automated, though I did just parse all of English Wikipedia for Github entries. Wikipedia will not be (is not) our only source, entries do not automatically go from discovery to inclusion (they are checked further, and also chosen manually for relevance, but please feel very free to suggest items you think are worth adding) but we will now try to slightly refine the process of adding new entries, first.

"We also want to encourage projects to leave Github."We want everybody to realise that Github holds more mainstream projects than many of us thought. We also want to encourage projects to leave Github. We already know that GNOME and KDE mostly have their own repositories, though at the moment it appears that Kate for example, may use Github for something (that is still being looked into.) It's surprising to discover gnuradio on Github -- we are also aware that some of these (like OpenBSD) are just mirrors.

This list really is a starting point, which can hopefully increase awareness about the issue of concern. As awareness increases, we fully expect the quality of this information to be further refined -- not unlike the software projects listed here as talented people review them and contribute to the sources.

Thanks again, you've helped make this list better than it was when we started.

Licence: Creative Commons CC0 1.0 (public domain)

Recent Techrights' Posts

A Week After a Worldwide Windows Outage Microsoft is 'Bricking' Windows All On Its Own, Cannot Blame Others Anymore
A look back at a week of lousy press coverage, Microsoft deceit, and lessons to be learned
 
Links 26/07/2024: Hamburgerization of Sushi and GNU/Linux Primer
Links for the day
Links 26/07/2024: Tesco Cutbacks and Fake Patent Courts
Links for the day
Links 26/07/2024: Grimy Residue of the 'AI' Bubble and Tensions Around Alaska
Links for the day
Gemini Links 26/07/2024: More Computers and Tilde Hosting
Links for the day
Links 26/07/2024: "AI" Hype Debunked and Elon Musk's "X" Already Spreads Political Disinformation
Links for the day
"Why you boss is insatiably horny for firing you and replacing you with software."
Ask McDonalds how this "AI" nonsense with IBM worked out for them
No Olympics
We really need to focus on real news
Nobody Holds the GNOME Foundation Accountable (Not Even IRS), It's Governed by Lawyers, Not Geeks, and Headed by a Shaman Crank
GNOME is a deeply oppressive institutions that eats its own
[Meme] The 'Modern' Web and 'Linux' Foundation Reinforcing Monopolies and Cementing centralisation
They don't care about the users and issuing a few bytes with random characters costs them next to nothing. It gives them control over billions of human beings.
'Boiling the Frog' or How Online Certificate Status Protocol (OCSP) is Being Abandoned at Short Notice by Let's Encrypt
This isn't a lack of foresight but planned obsolescence
When the LLM Bubble Implodes Completely Microsoft Will be 'Finished'
Excuses like, "it's not ready yet" or "we'll fix it" won't pass muster
"An escalator can never break: it can only become stairs"
The lesson of this story is, if you do evil things, bad things will come your way. So don't do evil things.
When Wikileaks Was Still Primarily a Wiki
less than 14 years ago the international media based its war journalism on what Wikileaks had published
The Free Software Foundation Speaks Out Against Microsoft
the problem is bigger than Microsoft and in the long run - seeing Microsoft's demise - we'll need to emphasise Software Freedom
IRC Proceedings: Thursday, July 25, 2024
IRC logs for Thursday, July 25, 2024
Over at Tux Machines...
GNU/Linux news for the past day
Links 26/07/2024: E-mail on OpenBSD and Emacs Fun
Links for the day
Links 25/07/2024: Talks of Increased Pension Age and Biden Explains Dropping Out
Links for the day
Links 25/07/2024: Paul Watson, Kernel Bug, and Taskwarrior
Links for the day
[Meme] Microsoft's "Dinobabies" Not Amused
a slur that comes from Microsoft's friends at IBM
Flashback: Microsoft Enslaves Black People (Modern Slavery) for Profit, or Even for Losses (Still Sinking in Debt Due to LLMs' Failure)
"Paid Kenyan Workers Less Than $2 Per Hour"
From Lion to Lamb: Microsoft Fell From 100% to 13% in Somalia (Lowest Since 2017)
If even one media outlet told you in 2010 that Microsoft would fall from 100% (of Web requests) to about 1 in 8 Web requests, you'd probably struggle to believe it
Microsoft Windows Became Rare in Antarctica
Antarctica's Web stats still near 0% for Windows
Links 25/07/2024: YouTube's Financial Problem (Even After Mass Layoffs), Journalists Bemoan Bogus YouTube Takedown Demands
Links for the day
Gemini Now 70 Capsules Short of 4,000 and Let's Encrypt Sinks Below 100 (Capsules) as Self-Signed Leaps to 91%
The "gopher with encryption" protocol is getting more widely used and more independent from GAFAM
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Wednesday, July 24, 2024
IRC logs for Wednesday, July 24, 2024
Techrights Statement on YouTube
YouTube is a dying platform
[Video] Julian Assange on the Right to Know
Publishing facts is spun as "espionage" by the US government and "treason" by the Russian government, to give two notable examples
Links 25/07/2024: Tesla's 45% Profit Drop, Humble Games Employees All Laid Off
Links for the day
Gemini Links 25/07/2024: Losing Grip and collapseOS
Links for the day