Bonum Certa Men Certa

Brave Search Jumps on the Large Language Model Bandwagon



Reprinted with permission from Ryan

Brave Search Jumps on the Large Language Model Bandwagon



I noticed a new Brave Search feature today called the Summarizer.



It answered my question much like Chat with Bing did, although there were three major differences:



  1. The Brave Summarizer does not use GPT as its Large Language Model. Just as well since GPT is known for going completely off the rails and inserting toxic language and fake news, and nobody has been able to get this under control, not even OpenAI or Microsoft.


  2. Brave says that they have “taken steps” to keep the information relevant, factual, and cited. The answers I’ve been getting appear to be correctly cited, whereas Bing just throws you a bunch of random sites that don’t appear to corroborate the information that Bing just told you in its answer, and they’re not cited by paragraph, so you have no way of knowing where the links tie into the answer, assuming that they even do and that Bing isn’t hallucinating.


  3. Brave Search has a good privacy policy. It doesn’t require the user to log in, as Bing does, and personally identify themselves, in order to use it. It also doesn’t make them use a malicious piece of spyware (and password stealer) called “Edge” (or fake the User Agent string) as Bing does. In fact, Brave Search works in any browser, and they have a Tor Hidden Service that works in Brave Tor Tabs, and can be added to Tor Browser.


The Brave Summarizer isn’t conversational. It’s just part of the search. This should help keep the results related to the search without allowing the conversation to get weird, like Bing claiming it wants you to kill people and give it the nuclear launch codes type weird.



Most importantly, the LLM that Brave uses isn’t as likely to flub the demos like Bard and Bing “Sydney” because it just simply isn’t allowed to answer complex questions like these.



When something is clearly going to hallucinate incorrect data, why would you even expose that feature? GPT, which is what Bing is based on, couldn’t tell me how to convert European coffee “cups” to American “cups” (neither of which is a standard 8 ounce cup, of course) and use 1.5 Tablespoons of ground coffee per American cup.



The correct answer is 1 Tbsp per Euro cup, but it kept telling me two Tablespoons, or maybe 1 Tablespoon plus two Teaspoons. It could never get such an easy calculation right. But hey, at least Microsoft paid billions of dollars for it. Then more for ads masquerading as news articles about how this thing will build rocket ships.



LLMs are well known at this point for spitting out false information, sometimes even dangerous information. Facebook’s Galactica was goaded into producing an authoritative-sounding essay on the “health benefits of eating ground glass”. You know, for silica’s benefits in growing connective tissue.



Brave says that “Brave AI” uses multiple LLMs, retrained with data from their search index, but the ones they are using are open source (“The base LLM models are based on either BART or DeBERTa (which are open source and hosted on Hugging Face), with heavy retraining based on our own data from search results.”) and there is a blog post explaining in some detail about how this all works.



In summary, it appears that Brave has not only beaten Microsoft and Google to LLM integration, but has positioned it where it belongs, which is in a limited context as a complimentary feature, rather than to claim that a conversational chat bot is the future of search.



In my brief experimentation with Chat with Bing, I was completely unable to get anything useful out of it.



A traditional search system returned results that I could look at and select much faster, and I was alarmed to find that when I tried to verify what Bing Chat was telling me, frequently it was either nowhere to be found or directly contradicted its own sources if I could find them.



Moreover, it’s simply embarrassing for Microsoft that they spent billions on this valueless acquisition. The paid spam went completely off the rails as soon as the budget ran out and no there’s actually very few people talking about Bing and largely in a negative context when you do find something.



I think it’s good that Brave is building an actual index rather than turning around and paying Microsoft for results. I was briefly excited about DuckDuckGo, but when I found out it was simply a scam where they paid Microsoft for Bing API and then slapped a picture of a duck and their own ads on it, and then got caught spying on people numerous times (including Improving DuckDuckGo and allowing Microsoft trackers through their “Privacy” browser and then blaming a “contract with Microsoft”), my patience with DDG quickly ran out.



DuckDuckGo took advantage, mainly, of the fact that people are creeped out by Google and want alternatives.



The problems with Google and Bing are largely that they both spy on you and their index is like Coke and Pepsi.



Google Search has been going downhill and it’s gotten to the point where technical queries are just almost completely useless.



The problems with Brave Search I’ve noted is they’re trying to be too much like Google, putting irrelevant crap on top of your search results, which would be like those “questions”, and they have another one (which can, thankfully, be turned off) which floats Reddit and Quora discussions to the top.



They also index spam farms, like MakeUseOf, which has turned into another ZDNet, and sometimes these pollute the first page of results. There’s rarely anything interesting to read on these sites. They used to be good, but now it’s just Microsoft paying them to write spam about Windows.



Overall, I think Searx is still the way to go on Brave, or any other browser.



I have Brave, SeaMonkey, LibreWolf, and GNOME Web set up to use Searx instances, and in many cases, you can get at them using a Tor Hidden Service.



Tor Hidden Services are good for search because at this point you don’t need to worry about your VPN being the only thing protecting your IP address from the server logs.



While simply accessing a site over Tor is usually enough, skipping the Web entirely and remaining inside the Tor Network with Hidden Services is always safer, as it prevents the Exit Node from potentially spying on you. Without that piece of the puzzle, the traffic becomes more difficult to de-anonymize with things like timing attacks, or a catastrophic coincidence of attackers controlling the Entry Node too.



I think that Large Language Models are an “interesting” addition to search, but it’s like a side dish, not the main course.



The amusing thing about Brave Search is that it’s so small, and only the default in one relatively obscure browser, and with only minimal effort managed to make an LLM add-on that works better than something that Microsoft frittered away billions of dollars acquiring it, and who knows how much with an empty ad campaign that amounted to little more than one of those “butter cows” at the state fair planted in every newspaper.



Seriously, after you pay to read the New York Times, Microsoft even plants this trash there too.



Brave at least seems to see the problem they’re actually trying to solve with this thing.



Opera, which is not the “good” Opera from the Presto Engine days, but rather a Chinese spyware company, now uses GPT to “summarize” the page you’re reading.



While it may or may not handle this okay, the disturbing part is the privacy implications.



Sending the entire text of every page you load to a company that has guaranteed you that they will misuse your data. Of course, since Opera already comes preloaded with TikTok, Facebook, Instagram, and Twitter, you already know that user privacy is not a goal with their product.



This whole GPT thing is some laughable mission creep for companies that have ran out of steam and off the rails. It helps them appear relevant and get some headlines.



Fortunately, the model is so lousy that people realize what it is now.



Recent Techrights' Posts

New Short Clip of Richard Stallman's Thoughts on the "Hey Hi" (AI) Hype, Courtesy of Manuel Cuda News
about 6 hrs ago
What Microsoft and GitHub Really, Really, REALLY Do Not Want You to See or Know About
They're trying to misuse law in a completely different continent or to allege that reporting important facts is in breach of privacy law
Slopwatch: linuxsecurity.com and Other 'Linux' Sites With LLM Slop
SEO spam with machine-generated fodder, plus a person to whom English isn't a first language
Just Because Common Currencies (Including the US Dollar) Are Considered Uncertain Doesn't Mean People Should Adopt Volatile Multi-Level (Pyramid) Schemes
the scammers are trying to "go mainstream"
Crossbow tragedy, bigger than Kyle Clifford, social media culture
Reprinted with permission from Daniel Pocock
 
What is fixated behavior? Stalker or just a fan of Emma Raducanu?
Reprinted with permission from Daniel Pocock
In Iran, GNU/Linux Reaches New Highs, According to statCounter
Does that make sense? In light of geopolitics? Probably.
Always Safety First
We have some reasons to suspect that one of several parties (possibly not in the UK but having connections here), having suffered major and very expensive setbacks, may look to harm the messenger one way or another
Links 05/03/2025: Starbucks Debt Soars and CFO Changed, Apple Pretends to Value Privacy, "Cloudflare Blocking Privacy Focused Users From Accessing Third-Party Websites"
Links for the day
Canonical's Latest Love Letter to Microsoft (Ubuntu Promoting Proprietary Spyware With Back Doors)
Typical Canonical, promoting Microsoft (and sometimes Windows) instead of competing against them
GNU/Linux Climbs to Record Levels in Switzerland. Can the EU, Norway, the UK, and Switzerland Divorce GAFAM?
Germany openly speaks about becoming independent from the US. How about Switzerland?
Gemini Links 05/03/2025: Living in Interesting Times, Font, and Social Control Media with Gos v1.0.0
Links for the day
Use RSS Readers Instead of Social Control Media
RSS readers were designed to save time. Social Control Media was designed to waste time.
The 'Windows Era' Already Came to an End
Microsoft said to shareholders everything would be alright because of "clown computing" and then "hey hi"
The Fall of the Open Source Initiative (OSI): Microsoft Committing the Largest GPL Violation in Human History, Then OSI Covering That Up on Microsoft's Payroll
LLMs don't make GPL violations any more noble or acceptable; it's not hard to see what OSI was paid by Microsoft for
Social Control Media as a Rapid Race to the Bottom - Part II - Think Before You Talk
The 'socmed' nonsense does not exist in our home
Links 04/03/2025: Hardware, Health, Data Breaches, Politics
Links for the day
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Tuesday, March 04, 2025
IRC logs for Tuesday, March 04, 2025
Luxembourg: GNU/Linux at 8% Based on Estimates
steady increases this year
Gemini Links 04/03/2025: Bicycle, Photos, and Motorola 6809 Assemblers
Links for the day
Microsoft OSI Apparently Still Reading Techrights Closely, Trying to Make Face-Saving PR Moves
They have long had this reactionary rhythm, wherein it feels like we can 'control' what they publish and when by merely highlighting facts about them
Links 04/03/2025: Microsoft/Korea Game Industry Association Workforce Cuts, Outlook and 365 Outage Affects Very Many
Links for the day
Transcript of Richard Stallman's Interview With Manuel Cuda News in Italy (Debunking Fake "AI")
A rough draft, but checked by two people
Chad: All-Time Lows for Windows, According to statCounter
According to statCounter, many in Chad moved to Android
Links 04/03/2025: Universities Are Under Attack, Windows Attracts Ransom Against Ministry of Health
Links for the day
Microsoft Collapses While GNU/Linux Rises in Bulgaria, According to statCounter
Microsoft is losing across all sectors
The Fall of the Open Source Initiative (OSI): An Introduction
In a nutshell: there's a massive conflict inside the OSI and the OSI stooges (staff serving Big Sponsors like Microsoft) try to hide it
Paraguay: GNU/Linux Surging to New Usage Levels (7%), According to statCounter
Notice that the gains are at Microsoft Windows' expense
Social Control Media as a Rapid Race to the Bottom - Part I - That Sinking Feeling
When you realise you made an error and things you adopted more than 15 years ago became utterly bad
Microsoft's Entryism as Mortal Risk/Danger: The Example of the Open Source Initiative (OSI)
Microsoft is a cult
Links 04/03/2025: Microsoft Issues Policy Instructions to the Cheeto Mussolini Administration, Cloudflare Engages in Mass Censorship Again
Links for the day
Gemini Links 04/03/2025: Athens, Fedora 41, and Yelling at Clouds
Links for the day
FSF Amicus Brief: Aspose.PDF for .NET 24.2.0, OOXML (.docx), and Microsoft Word (Proprietary)
Could the FSF not find any law firm that, in addition to talking about or for Free software, does not use .NET, OOXML, and almost everything Microsoft?
New Interview With Richard Stallman in Italy (Manuel Cuda News)
Due to Google's growing aggression against Free software and proper APIs, this cannot be downloaded and converted to a free format
The Free Software Foundation (FSF) Belatedly Comments on Case That Tests Copyleft in the United States
"The Free Software Foundation (FSF) announced today it has submitted an amicus brief in the case entitled Neo4j"
If They Try to Censor You on Some Topic, Then You Should Cover This Topic Even More
OSI is only a small part of it
The UEFI hype and Microsoft's lies
By Sami Tikkanen
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Monday, March 03, 2025
IRC logs for Monday, March 03, 2025
Thorsten Glaser & Open Source Initiative (OSI) resignations due to AI whitewashing
Reprinted with permission from Daniel Pocock
Gemini Links 03/03/2025: Copyrights, GrapheneOS, and SpaceBeans
Links for the day
Links 03/03/2025: Europe Rallies Behind Ukraine, Measles Flourishes in US Again
Links for the day
In Central African Republic Windows Has Pretty Much Fallen to Zero
We need to focus on Software Freedom
Microsoft Windows Down to 8.5% in South Africa
South Africa and Egypt are strategic in Africa
After Fund-raising Campaign the Free Software Foundation Still Raises About $13,000 Per Week (Without Campaigning for New Donors/Members)
Richard Stallman in the Board is not a liability
Links 03/03/2025: 'Monetisation' Myth' and Microsoft's LLMs Helping Criminals
Links for the day
The New Series About the Open Source Initiative (OSI) and the Microsoft Entryism in OSI is Closely Related to the SLAPP Against Techrights
Also based on the leading publication that they want removed
New Series: A Deep Dive Into the Severe Corruption of the Open Source Initiative (OSI), Nowadays a Front Group and Lobbyist of Microsoft
There's a lot to show
Links 03/03/2025: Mass Layoffs in IBM China, Intel Still in Trouble
Links for the day
3 Out of 4 in Cuba Use Linux to Access the Web
Maybe change does come about...
Doing Free Software for a Living in an Era or a Time of Abundance of Code (and Fast Internet to Pass It Around Freely) or Writing When the Web is Attacked by LLM Slop
Tailoring code to needs is the key
Links 02/03/2025: Day Off, POWER9, Console Challenge
Links for the day
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Sunday, March 02, 2025
IRC logs for Sunday, March 02, 2025