EditorsAbout the SiteComes vs. MicrosoftUsing This Web SiteSite ArchivesCredibility IndexOOXMLOpenDocumentPatentsNovellNews DigestSite NewsRSS

11.12.19

Maintaining the ‘Delete Github’ page

Posted in Microsoft, Site News at 11:31 am by Dr. Roy Schestowitz

By figosdev

GitHub RepositoriesSummary: “This list really is a starting point, which can hopefully increase awareness about the issue of concern.”

THE primary goal of the Delete Github page is an early-warning system for projects that Microsoft has undue control over — they can restrict access, delete projects, or (perhaps in a worst-case scenario) either deliberately modify a project download as SourceForge once did, or simply act a vector for malicious activity — as has already happened to Gentoo and Canonical.

“At this stage there is an effort to be as comprehensive as possible — to include as many mainstream Free software projects as we can, to give people a better idea of how many projects are potentially threatened by a monopoly.”As with higher quality warning systems, including all anti-malware software, false positives are a known possibility. Some care is taken to avoid them, simply to increase the quality of the list itself. At this stage there is an effort to be as comprehensive as possible — to include as many mainstream Free software projects as we can, to give people a better idea of how many projects are potentially threatened by a monopoly.

Admittedly, there is a lower threshold of evidence to get on the list than to be removed. While people may decide to make decisions based on this list, we hope they would double-check our findings first. This is research, it is not law — the list includes IceWM, which I use routinely, and JWM, which I have running as a process as we speak. It includes Leafpad, which I am using to type this very line of text. If we find evidence that WordPress is using Github (and there is a repo for it, I haven’t checked it yet and it isn’t on the list yet) it is unlikely that Roy will decide to remove it from the Techrights servers.

“To some of us, this list is like an endangered species list — we are concerned about the future of projects that are hosted on Github.”If a clerical error places an allergy you don’t have on your medical records, it is very possible that when this error is found, a single line will be drawn through the note. That is the sort of response that will most likely come of discovering errors in the list. A prominent note (perhaps in bold text) is the most likely correction.

To some of us, this list is like an endangered species list — we are concerned about the future of projects that are hosted on Github. And we are most eager to update the status of each one (even possibly Systemd, not necessarily for any wishes of success.)

But how can we know that we are in error? Certainly there is some level of assurance we can be given, most likely we will try to re-evaluate items that are called into question as our priorities allow. But we can check on the first one during the writing of this article:

“We will also probably use this lowercase convention for projects that abandon Github for other repos — if it’s all lowercase, that means we no longer believe the project is relying on Github.”“OpenBSD does not use Github for development at all Roy. They do not even use git!”

“I’m sure there are others in the list that also do not use github.”

Hopefully so. And by no means is Wikipedia a facts-only source of information, but if we simply grep https://en.wikipedia.org/wiki/OpenBSD:

$ leafpad <(wget -O- https://en.wikipedia.org/wiki/OpenBSD | tr ">" "\n" | grep ithub)

We get this:

<a rel="nofollow" class="external text" href="https://github.com/openbsd"

“When this list started, it was a completely manual process.”Now, it’s possible that OpenBSD doesn’t use this repo. If so, the article should be corrected. Let’s increase the quality of this check and visit the actual article: https://en.wikipedia.org/wiki/OpenBSD

Ah, now — one of the not-yet-implemented (though already considered) quality checks is to verify if the Github link is in the summary box (or whatever it’s called, I’m not a Wikipedian) on the top right. Selecting it and right-clicking, “View Selection Source” we find this is probably called an “infobox” and that it begins with:

<table class="infobox vevent"

It completes with "</tbody></table>" which we can use in the future to parse infoboxes.

Doing a further search for our Github link, we find the non-url text of our link is in the “External links” section, titled “Github mirror”.

“There are still checks in place, but in the writing of this article we have found things we can do to refine the process.”What have we learned:

1. The Github link isn’t in the infobox section, which decreases the significance of the link.

2. The official repo is https://cvsweb.openbsd.org/cgi-bin/cvsweb/ this is actually the most pertinent information.

3. The text of the very link that put OpenBSD on our list was labeled “Github mirror” which isn’t necessarily criteria we want or need to add projects to this list.

“Obviously people who are more intimately familiar with these projects are encouraged to help us, as arm did.”I would say this warrants a correction, which I will issue now before the article is finished. Since the first letter of every list entry is capitalised, we will make it easier to parse / automatically filter corrected entries by making the title all lowercase: OpenBSD -> openbsd. And… updated. Thanks to arm for setting this straight.

We will also probably use this lowercase convention for projects that abandon Github for other repos — if it’s all lowercase, that means we no longer believe the project is relying on Github.

When this list started, it was a completely manual process. Since then, (only) some aspects have drifted towards rudimentary automation. This has produced a more complete list, with a slightly higher tendency towards false positives. There are still checks in place, but in the writing of this article we have found things we can do to refine the process.

“For the moment, the highest priority is still adding to the list, so we may not double check every existing entry right away.”Even after items are added, we can use this new information to double-check the entries (even all at once.) That doesn’t mean this will happen today, but it may ultimately make it easier to discover when projects abandon Github, which would be a bonus. Obviously people who are more intimately familiar with these projects are encouraged to help us, as arm did.

For the moment, the highest priority is still adding to the list, so we may not double check every existing entry right away. This is the stage where the list is built as large as possible, within reason and utility.

At no point has the process become entirely automated, though I did just parse all of English Wikipedia for Github entries. Wikipedia will not be (is not) our only source, entries do not automatically go from discovery to inclusion (they are checked further, and also chosen manually for relevance, but please feel very free to suggest items you think are worth adding) but we will now try to slightly refine the process of adding new entries, first.

“We also want to encourage projects to leave Github.”We want everybody to realise that Github holds more mainstream projects than many of us thought. We also want to encourage projects to leave Github. We already know that GNOME and KDE mostly have their own repositories, though at the moment it appears that Kate for example, may use Github for something (that is still being looked into.) It’s surprising to discover gnuradio on Github — we are also aware that some of these (like OpenBSD) are just mirrors.

This list really is a starting point, which can hopefully increase awareness about the issue of concern. As awareness increases, we fully expect the quality of this information to be further refined — not unlike the software projects listed here as talented people review them and contribute to the sources.

Thanks again, you’ve helped make this list better than it was when we started.

Licence: Creative Commons CC0 1.0 (public domain)

Share this post: These icons link to social bookmarking sites where readers can share and discover new web pages.
  • Digg
  • del.icio.us
  • Reddit
  • co.mments
  • DZone
  • email
  • Google Bookmarks
  • LinkedIn
  • NewsVine
  • Print
  • Technorati
  • TwitThis
  • Facebook

If you liked this post, consider subscribing to the RSS feed or join us now at the IRC channels.

Pages that cross-reference this one

What Else is New


  1. EPO: Goodbye to the Rule of Law and Hey Hi, AI!

    The EPO’s embrace of buzzwords — no longer a unique EPO strategy (it has already spread elsewhere) — puts examiners in a very bad position and they’re grappling with nerve- and mind-racking dilemmas (risk of unemployment for truly upholding the EPC)



  2. IRC Proceedings: Sunday, January 26, 2020

    IRC logs for Sunday, January 26, 2020



  3. Links 26/1/2020: MuseScore 3.4 Released, New Kate Icon and Solus 4.1 Fortitude Available

    Links for the day



  4. MIT and Microsoft Have Done Nothing to Actually Tackle Pedophilia and Ephebophilia

    MIT never actually resolved the issue that caused Joi Ito, Richard Stallman and others to be ejected; Microsoft meanwhile continues to profit from life-changing abuse (while seeding puff pieces in friendly media, just to pretend otherwise)



  5. Opinion: If You Advocate Population Control and You Are Yourself Doubling in One Single Generation, Then You Might be Hypocritical

    People with 3-5 children (each) tell us that the world has an overpopulation problem; while the growth of the population certainly poses a risk, these people lack the moral authority to lecture us about that (unless they adopt a eugenicist worldview, wherein only particular people are permitted to reproduce)



  6. IRC Proceedings: Saturday, January 25, 2020

    IRC logs for Saturday, January 25, 2020



  7. Nothing Has Truly Changed Since Netscape and Antitrust

    The same old crimes persist, as well as the blatantly anticompetitive behaviour



  8. When the Monopolists and the Patent Litigation Industry Hijack the News They Control the Narrative

    Money buys perception and litigation firms have certainly 'bought' the media coverage, which fails to convey the issue at stake and instead paints a rational court decision as tragedy for "innovation" (by "innovation" they mean monopolies on nature and on life)



  9. Links 25/1/2020: OPNsense 20.1 RC1 and DXVK 1.5.2

    Links for the day



  10. The Linux Kernel is No Longer Free Software?

    Gardiner Bryant, the creator of The Linux Gamer as well as The Off Topical Podcast, reacts to our articles about DRM in Linux (he even pronounced my name correctly)



  11. Sometimes Proprietary Software is Proprietary (Secret) Simply Because It is Not Good and Obfuscation Helps Hide Just How Ugly It Is

    Why nonfree (or proprietary) software generally fails to catch up with Free/libre software — at least on technical grounds — and then makes up for it with marketing and FUD offensives (discrediting perfectly-functioning things, based on their perceived cost)



  12. IRC Proceedings: Friday, January 24, 2020

    IRC logs for Friday, January 24, 2020



  13. Links 24/1/2020: GNU/Linux in Russia and More New Openings

    Links for the day



  14. When EPO Press Coverage Boils Down to Lobbying, Press Releases, EPO Lies, and Bribery

    Any attempts to properly assess and explain what happens in Europe's patent landscape are being drowned out by EPO-bribed and law firms-connected media; to make matters worse, the EPO's bribes have expanded to academia, so even scholarly work in this domain is corrupted by money of special interest groups



  15. IRC Proceedings: Thursday, January 23, 2020

    IRC logs for Thursday, January 23, 2020



  16. Links 23/1/2020: Qubes OS 4.0.3, EasyOS 2.2.5, GhostBSD 20.01

    Links for the day



  17. Passion of the Microsoft

    A rough timeline of Microsoft’s interactions with Linux and the Linux Foundation since 2015



  18. The Patent Microcosm is Really Panicking as European Patents on Life and Other Spurious Junk (Invalid Patents) Are Successfully Rejected

    European Patents (EPs) may be revoked en masse if what we're seeing is the gradual emergence of 'European Mayo' (and maybe soon 'European Alice')



  19. Distractions From Microsoft's Gigantic Tax Evasion and Contribution to Denial of Climate Science

    Microsoft (connected to oil companies) wants us to think of it as a "green" company; not only does it contribute to climate denial but it also evades tax, which is a serious crime that costs tens of billions of dollars (the public pays this money instead)



  20. Confirmation: System1/Startpage Offered Pay to People Who Pushed for (Re)Listing in Privacy Directories

    The debate is now settled; those arguing in favour of listing Startpage as privacy-respecting are in fact secretly 'compensated' by Startpage (in other words, they're Startpage 'shills')



  21. Vandana Shiva: “Bill Gates is Continuing the Work of Monsanto”

    A recent interview on what Bill Gates is really up to in that sham ‘charity’ of his



  22. IRC Proceedings: Wednesday, January 22, 2020

    IRC logs for Wednesday, January 22, 2020



  23. Extending Linux With DRM, Azure and exFAT

    An insufficiently 'conservative' Linux ceases to be freedom-respecting



  24. Linux Foundation (LF) Now Dominated by Lots of Microsoft People and LF Chiefs Join Microsoft in Smearing GPL/Copyleft

    We continue to see additional evidence which serves towards reinforcing our view that the so-called 'Linux' Foundation is actually hostile towards many things that are associated with Linux (unlike those looking to exploit/hijack Linux for proprietary ends)



  25. Links 22/1/2020: Wayland 1.18 Alpha, ODF 1.3 Approved

    Links for the day



  26. IRC Proceedings: Tuesday, January 21, 2020

    IRC logs for Tuesday, January 21, 2020



  27. Poor Excuses for Granting Poor (and Often Illegal/Invalid) Patents

    A quick look at some of the latest examples of software patents advocacy (not by actual software professionals, obviously) and why it's deeply misguided (or guided solely by greedy law firms)



  28. A Simple Plan For a Universal Free Software Community

    "For software to be free as in freedom, we need more people to care personally about software freedom."



  29. Links 21/1/2020: Wine 5.0 and Red Hat Enterprise Linux 8.2 Beta

    Links for the day



  30. Startpage/System1 Almost Definitely Pay for People to Lie About Their Surveillance

    A longterm investigation suggests that there are forces in the debate that aren't objective and are being super evasive and dodgy; this typically happens only when somebody has much to hide


RSS 64x64RSS Feed: subscribe to the RSS feed for regular updates

Home iconSite Wiki: You can improve this site by helping the extension of the site's content

Home iconSite Home: Background about the site and some key features in the front page

Chat iconIRC Channel: Come and chat with us in real time

Recent Posts