EditorsAbout the SiteComes vs. MicrosoftUsing This Web SiteSite ArchivesCredibility IndexOOXMLOpenDocumentPatentsNovellNews DigestSite NewsRSS

11.12.19

Maintaining the ‘Delete Github’ page

Posted in Microsoft, Site News at 11:31 am by Dr. Roy Schestowitz

By figosdev

GitHub RepositoriesSummary: “This list really is a starting point, which can hopefully increase awareness about the issue of concern.”

THE primary goal of the Delete Github page is an early-warning system for projects that Microsoft has undue control over — they can restrict access, delete projects, or (perhaps in a worst-case scenario) either deliberately modify a project download as SourceForge once did, or simply act a vector for malicious activity — as has already happened to Gentoo and Canonical.

“At this stage there is an effort to be as comprehensive as possible — to include as many mainstream Free software projects as we can, to give people a better idea of how many projects are potentially threatened by a monopoly.”As with higher quality warning systems, including all anti-malware software, false positives are a known possibility. Some care is taken to avoid them, simply to increase the quality of the list itself. At this stage there is an effort to be as comprehensive as possible — to include as many mainstream Free software projects as we can, to give people a better idea of how many projects are potentially threatened by a monopoly.

Admittedly, there is a lower threshold of evidence to get on the list than to be removed. While people may decide to make decisions based on this list, we hope they would double-check our findings first. This is research, it is not law — the list includes IceWM, which I use routinely, and JWM, which I have running as a process as we speak. It includes Leafpad, which I am using to type this very line of text. If we find evidence that WordPress is using Github (and there is a repo for it, I haven’t checked it yet and it isn’t on the list yet) it is unlikely that Roy will decide to remove it from the Techrights servers.

“To some of us, this list is like an endangered species list — we are concerned about the future of projects that are hosted on Github.”If a clerical error places an allergy you don’t have on your medical records, it is very possible that when this error is found, a single line will be drawn through the note. That is the sort of response that will most likely come of discovering errors in the list. A prominent note (perhaps in bold text) is the most likely correction.

To some of us, this list is like an endangered species list — we are concerned about the future of projects that are hosted on Github. And we are most eager to update the status of each one (even possibly Systemd, not necessarily for any wishes of success.)

But how can we know that we are in error? Certainly there is some level of assurance we can be given, most likely we will try to re-evaluate items that are called into question as our priorities allow. But we can check on the first one during the writing of this article:

“We will also probably use this lowercase convention for projects that abandon Github for other repos — if it’s all lowercase, that means we no longer believe the project is relying on Github.”“OpenBSD does not use Github for development at all Roy. They do not even use git!”

“I’m sure there are others in the list that also do not use github.”

Hopefully so. And by no means is Wikipedia a facts-only source of information, but if we simply grep https://en.wikipedia.org/wiki/OpenBSD:

$ leafpad <(wget -O- https://en.wikipedia.org/wiki/OpenBSD | tr ">" "\n" | grep ithub)

We get this:

<a rel="nofollow" class="external text" href="https://github.com/openbsd"

“When this list started, it was a completely manual process.”Now, it’s possible that OpenBSD doesn’t use this repo. If so, the article should be corrected. Let’s increase the quality of this check and visit the actual article: https://en.wikipedia.org/wiki/OpenBSD

Ah, now — one of the not-yet-implemented (though already considered) quality checks is to verify if the Github link is in the summary box (or whatever it’s called, I’m not a Wikipedian) on the top right. Selecting it and right-clicking, “View Selection Source” we find this is probably called an “infobox” and that it begins with:

<table class="infobox vevent"

It completes with "</tbody></table>" which we can use in the future to parse infoboxes.

Doing a further search for our Github link, we find the non-url text of our link is in the “External links” section, titled “Github mirror”.

“There are still checks in place, but in the writing of this article we have found things we can do to refine the process.”What have we learned:

1. The Github link isn’t in the infobox section, which decreases the significance of the link.

2. The official repo is https://cvsweb.openbsd.org/cgi-bin/cvsweb/ this is actually the most pertinent information.

3. The text of the very link that put OpenBSD on our list was labeled “Github mirror” which isn’t necessarily criteria we want or need to add projects to this list.

“Obviously people who are more intimately familiar with these projects are encouraged to help us, as arm did.”I would say this warrants a correction, which I will issue now before the article is finished. Since the first letter of every list entry is capitalised, we will make it easier to parse / automatically filter corrected entries by making the title all lowercase: OpenBSD -> openbsd. And… updated. Thanks to arm for setting this straight.

We will also probably use this lowercase convention for projects that abandon Github for other repos — if it’s all lowercase, that means we no longer believe the project is relying on Github.

When this list started, it was a completely manual process. Since then, (only) some aspects have drifted towards rudimentary automation. This has produced a more complete list, with a slightly higher tendency towards false positives. There are still checks in place, but in the writing of this article we have found things we can do to refine the process.

“For the moment, the highest priority is still adding to the list, so we may not double check every existing entry right away.”Even after items are added, we can use this new information to double-check the entries (even all at once.) That doesn’t mean this will happen today, but it may ultimately make it easier to discover when projects abandon Github, which would be a bonus. Obviously people who are more intimately familiar with these projects are encouraged to help us, as arm did.

For the moment, the highest priority is still adding to the list, so we may not double check every existing entry right away. This is the stage where the list is built as large as possible, within reason and utility.

At no point has the process become entirely automated, though I did just parse all of English Wikipedia for Github entries. Wikipedia will not be (is not) our only source, entries do not automatically go from discovery to inclusion (they are checked further, and also chosen manually for relevance, but please feel very free to suggest items you think are worth adding) but we will now try to slightly refine the process of adding new entries, first.

“We also want to encourage projects to leave Github.”We want everybody to realise that Github holds more mainstream projects than many of us thought. We also want to encourage projects to leave Github. We already know that GNOME and KDE mostly have their own repositories, though at the moment it appears that Kate for example, may use Github for something (that is still being looked into.) It’s surprising to discover gnuradio on Github — we are also aware that some of these (like OpenBSD) are just mirrors.

This list really is a starting point, which can hopefully increase awareness about the issue of concern. As awareness increases, we fully expect the quality of this information to be further refined — not unlike the software projects listed here as talented people review them and contribute to the sources.

Thanks again, you’ve helped make this list better than it was when we started.

Licence: Creative Commons CC0 1.0 (public domain)

Share this post: These icons link to social bookmarking sites where readers can share and discover new web pages.
  • Digg
  • del.icio.us
  • Reddit
  • co.mments
  • DZone
  • email
  • Google Bookmarks
  • LinkedIn
  • NewsVine
  • Print
  • Technorati
  • TwitThis
  • Facebook

If you liked this post, consider subscribing to the RSS feed or join us now at the IRC channels.

Pages that cross-reference this one

What Else is New


  1. Links 6/12/2019: DRM in GNU/Linux and Sparky Bonsai

    Links for the day



  2. The EPO Rejects Innovation

    The EPO ceased caring about the needs of scientists whose work involves invention; instead, EPO management crafts increasingly lenient guidelines that yield illegal European Patents (not compatible with the EPC) that heavily-besieged EPO judges are unable to stop



  3. Startpage CEO Robert Beens in 'Damage Control' Mode, Trying to Get Startpage Relisted After Selling to a Massive Surveillance Company

    PrivacytoolsIO is being lobbied by the CEO of Startpage to relist Startpage, based on no actual refutations at all



  4. IRC Proceedings: Thursday, December 05, 2019

    IRC logs for Thursday, December 05, 2019



  5. Links 5/12/2019: qBittorrent 4.2.0, Expensive Librem 5 and OpenBSD Bugs

    Links for the day



  6. Microsoft Staff Repeatedly Refuses to Tell How Many People Use WSL, Defends Patent Extortion and Blackmail of Linux Instead

    The people who develop WSL (mostly Microsoft employees) get easily irritated when asked how many people actually use this thing; but more interestingly, however, they reveal their disdain for GNU/Linux and support for Microsoft blackmail (for 'Linux patent tax')



  7. IRC Proceedings: Wednesday, December 04, 2019

    IRC logs for Wednesday, December 04, 2019



  8. Links 4/12/2019: Tails 4.1, UCS 4.4-3 and Proxmox VE 6.1

    Links for the day



  9. Google Tightens Its Noose

    Now it’s official! Google is just a bunch of shareholders looking to appease the Pentagon at all costs



  10. Europeans Still Need to Save the European Patent Office From Those Who Attack Its Patent Quality

    Patent quality is of utmost interest; without it, as we're seeing at the EPO and have already seen at the USPTO for a number of years, legal disputes will arise where neither side wins (only the lawyers win) and small, impoverished inventors or businesses will be forced to settle outside the courts over baseless allegations, often made by parasitic patent trolls (possessing low-quality patents they don't want scrutinised by courts)



  11. We Never Accepted and Will Never Accept Corporate Money

    Corporate money is a unique problem because of its magnitude and the fact that it's impersonal; shareholders can only ever accept its supposed justifications if they're receiving something in return (of proportional worth to the payment/transaction)



  12. IRC Proceedings: Tuesday, December 03, 2019

    IRC logs for Tuesday, December 03, 2019



  13. Links 3/12/2019: elementary OS 5.1 Hera, Plasma 5.17.4, Firefox 71

    Links for the day



  14. Laundering the Reputation of Criminals: That's an Actual Job

    An important reminder that the manufactured, paid-for (media is being bribed) image of Bill Gates is the product of the PR industry he enlisted to distract from his endless crimes



  15. 'Priceless' Tickets to the EPO's Back End and Team UPC

    CIPA's and the EPO's event (later this week) is more of the same; the EPO exists not to serve European businesses but a bunch of law firms and their biggest clients (which usually aren't even European)



  16. IRC Proceedings: Monday, December 02, 2019

    IRC logs for Monday, December 02, 2019



  17. New EPO Leak Shows That the Rumours and Jokes Are Partly True and We Know Who 'Runs the Show'

    Europe’s second-largest institution is so profoundly dysfunctional, a reprehensible kakistocracy of tribalism, money-grabbing career-climbing autocrats and possibly major fraud; today’s leak looks at what motivated and enabled the formation and latest incarnation of “Team Campinos”



  18. Links 2/12/2019: Linux Mint 19.3 Beta, DPL Sam Hartman Talks About SystemD

    Links for the day



  19. What Former Debian Project Leader (Second to the Late Ian Murdock) Thinks About SystemD in Debian GNU/Linux

    Now that Debian is debating and voting on diversity in the technical sense the thoughts of Bruce Perens merit broader audience/reach



  20. Free/Libre Software Will Eventually Become the Norm, 'Open Source' is Just Proprietary Software Trying to 'Buy Time'

    More people are starting to ask questions about Free software while “Open Source” languishes (people can see it’s just a mask for proprietary software); it was a two-decade delaying tactic that’s wearing off (people see GitHub and the OSI/Linux Foundation for what they really are)



  21. IRC Proceedings: Sunday, December 01, 2019

    IRC logs for Sunday, December 01, 2019



  22. Richard Stallman is Active and Doing Well

    The rumour mill may still be humming along; but against all odds — as Chief GNUisance of the GNU Project — Stallman keeps fighting the good fight (in the face of growing resistance)



  23. Banning Former Microsoft Employees Who Complain About Microsoft Lies, Abuses and Crimes

    The official account of Windows Insider is banning people whom it never even spoke to; this seems like a way of 'punishing' people who are not 'true believers' in Microsoft



  24. Wikileaks: Thierry Breton May Have Misused Regulatory/Government Positions to Attack His Competition (in the Market)

    Thierry 'revolving doors' Breton as seen by the United States government



  25. 13 Years of UPC Promises

    The anatomy of UPC 'fake news' or lobbying tactics along the lines of self-fulfilling prophecies and false predictions



  26. Is Water Wet?

    The criteria for patent eligibility reduced only to this question: will allowing these patents increase ‘production’ (number of patent grants)?



  27. The EPO's President Admits He's Illegally Granting Software Patents (CII, 4IR, IoT, AI and Blockchain Mean Software Patents at the EPO)

    The EPO's chief liar is openly and proudly promoting software patents using buzzwords and hype waves (and mysterious acronyms that are rather meaningless but spread by the media in exchange for money received from the EPO)



  28. Tone Policing and the Linux Foundation

    A timely example of situations where the Linux Foundation can seemingly 'cancel' people (using the Code of Conduct) for political opinions



  29. It EEEsn't Just a Microsoft Thing Anymore

    The EEErosion of Python's independence is a known problem and Microsoft is not the sole culprit



  30. Links 1/12/2019: KDE's GTK CSD Support, Skrooge 2.21.0

    Links for the day


RSS 64x64RSS Feed: subscribe to the RSS feed for regular updates

Home iconSite Wiki: You can improve this site by helping the extension of the site's content

Home iconSite Home: Background about the site and some key features in the front page

Chat iconIRC Channel: Come and chat with us in real time

Recent Posts