Bonum Certa Men Certa

Brave Search Jumps on the Large Language Model Bandwagon



Reprinted with permission from Ryan

Brave Search Jumps on the Large Language Model Bandwagon



I noticed a new Brave Search feature today called the Summarizer.



It answered my question much like Chat with Bing did, although there were three major differences:



  1. The Brave Summarizer does not use GPT as its Large Language Model. Just as well since GPT is known for going completely off the rails and inserting toxic language and fake news, and nobody has been able to get this under control, not even OpenAI or Microsoft.


  2. Brave says that they have “taken steps” to keep the information relevant, factual, and cited. The answers I’ve been getting appear to be correctly cited, whereas Bing just throws you a bunch of random sites that don’t appear to corroborate the information that Bing just told you in its answer, and they’re not cited by paragraph, so you have no way of knowing where the links tie into the answer, assuming that they even do and that Bing isn’t hallucinating.


  3. Brave Search has a good privacy policy. It doesn’t require the user to log in, as Bing does, and personally identify themselves, in order to use it. It also doesn’t make them use a malicious piece of spyware (and password stealer) called “Edge” (or fake the User Agent string) as Bing does. In fact, Brave Search works in any browser, and they have a Tor Hidden Service that works in Brave Tor Tabs, and can be added to Tor Browser.


The Brave Summarizer isn’t conversational. It’s just part of the search. This should help keep the results related to the search without allowing the conversation to get weird, like Bing claiming it wants you to kill people and give it the nuclear launch codes type weird.



Most importantly, the LLM that Brave uses isn’t as likely to flub the demos like Bard and Bing “Sydney” because it just simply isn’t allowed to answer complex questions like these.



When something is clearly going to hallucinate incorrect data, why would you even expose that feature? GPT, which is what Bing is based on, couldn’t tell me how to convert European coffee “cups” to American “cups” (neither of which is a standard 8 ounce cup, of course) and use 1.5 Tablespoons of ground coffee per American cup.



The correct answer is 1 Tbsp per Euro cup, but it kept telling me two Tablespoons, or maybe 1 Tablespoon plus two Teaspoons. It could never get such an easy calculation right. But hey, at least Microsoft paid billions of dollars for it. Then more for ads masquerading as news articles about how this thing will build rocket ships.



LLMs are well known at this point for spitting out false information, sometimes even dangerous information. Facebook’s Galactica was goaded into producing an authoritative-sounding essay on the “health benefits of eating ground glass”. You know, for silica’s benefits in growing connective tissue.



Brave says that “Brave AI” uses multiple LLMs, retrained with data from their search index, but the ones they are using are open source (“The base LLM models are based on either BART or DeBERTa (which are open source and hosted on Hugging Face), with heavy retraining based on our own data from search results.”) and there is a blog post explaining in some detail about how this all works.



In summary, it appears that Brave has not only beaten Microsoft and Google to LLM integration, but has positioned it where it belongs, which is in a limited context as a complimentary feature, rather than to claim that a conversational chat bot is the future of search.



In my brief experimentation with Chat with Bing, I was completely unable to get anything useful out of it.



A traditional search system returned results that I could look at and select much faster, and I was alarmed to find that when I tried to verify what Bing Chat was telling me, frequently it was either nowhere to be found or directly contradicted its own sources if I could find them.



Moreover, it’s simply embarrassing for Microsoft that they spent billions on this valueless acquisition. The paid spam went completely off the rails as soon as the budget ran out and no there’s actually very few people talking about Bing and largely in a negative context when you do find something.



I think it’s good that Brave is building an actual index rather than turning around and paying Microsoft for results. I was briefly excited about DuckDuckGo, but when I found out it was simply a scam where they paid Microsoft for Bing API and then slapped a picture of a duck and their own ads on it, and then got caught spying on people numerous times (including Improving DuckDuckGo and allowing Microsoft trackers through their “Privacy” browser and then blaming a “contract with Microsoft”), my patience with DDG quickly ran out.



DuckDuckGo took advantage, mainly, of the fact that people are creeped out by Google and want alternatives.



The problems with Google and Bing are largely that they both spy on you and their index is like Coke and Pepsi.



Google Search has been going downhill and it’s gotten to the point where technical queries are just almost completely useless.



The problems with Brave Search I’ve noted is they’re trying to be too much like Google, putting irrelevant crap on top of your search results, which would be like those “questions”, and they have another one (which can, thankfully, be turned off) which floats Reddit and Quora discussions to the top.



They also index spam farms, like MakeUseOf, which has turned into another ZDNet, and sometimes these pollute the first page of results. There’s rarely anything interesting to read on these sites. They used to be good, but now it’s just Microsoft paying them to write spam about Windows.



Overall, I think Searx is still the way to go on Brave, or any other browser.



I have Brave, SeaMonkey, LibreWolf, and GNOME Web set up to use Searx instances, and in many cases, you can get at them using a Tor Hidden Service.



Tor Hidden Services are good for search because at this point you don’t need to worry about your VPN being the only thing protecting your IP address from the server logs.



While simply accessing a site over Tor is usually enough, skipping the Web entirely and remaining inside the Tor Network with Hidden Services is always safer, as it prevents the Exit Node from potentially spying on you. Without that piece of the puzzle, the traffic becomes more difficult to de-anonymize with things like timing attacks, or a catastrophic coincidence of attackers controlling the Entry Node too.



I think that Large Language Models are an “interesting” addition to search, but it’s like a side dish, not the main course.



The amusing thing about Brave Search is that it’s so small, and only the default in one relatively obscure browser, and with only minimal effort managed to make an LLM add-on that works better than something that Microsoft frittered away billions of dollars acquiring it, and who knows how much with an empty ad campaign that amounted to little more than one of those “butter cows” at the state fair planted in every newspaper.



Seriously, after you pay to read the New York Times, Microsoft even plants this trash there too.



Brave at least seems to see the problem they’re actually trying to solve with this thing.



Opera, which is not the “good” Opera from the Presto Engine days, but rather a Chinese spyware company, now uses GPT to “summarize” the page you’re reading.



While it may or may not handle this okay, the disturbing part is the privacy implications.



Sending the entire text of every page you load to a company that has guaranteed you that they will misuse your data. Of course, since Opera already comes preloaded with TikTok, Facebook, Instagram, and Twitter, you already know that user privacy is not a goal with their product.



This whole GPT thing is some laughable mission creep for companies that have ran out of steam and off the rails. It helps them appear relevant and get some headlines.



Fortunately, the model is so lousy that people realize what it is now.



Recent Techrights' Posts

Slopwatch: Plagiarism, Fake Articles, and FUD About Linux
not a day goes by without Google News feeding FUD from slopfarms
Gemini Links 01/10/2025: Chat Control and End of Life
Links for the day
Links 01/10/2025: Long Covid Risk Reiterated, "Bitcoin Queen" Caught
Links for the day
Links 01/10/2025: EA $55 Billion Deal is Debt and Slop "Raises Vishing Risks"
Links for the day
Bluewashing at Red Hat Means Redundancies
The man who sold Red Hat to IBM meanwhile became a Microsoft Mono booster
After Killing OpenSource.com, IBM ('Red Hat') and OSI Told Us OpenSource.net Would Replace It (But That Didn't Happen)
Now it's time to move on, perhaps tarnishing the "Open Source" label some more (for whatever sponsor wants this)
Linux is Not a Community Project, It's a Wall Street Product
The core goal should be freedom
Bad Actors Abusing the Free Software Community, Vandalising It Using Rogue Politics and Old Tactics
Oil giants have long attempted to do this; now, the digital equivalent of Big Oil does this in technology
Social Control Media Isn't the Future, The Federation or Fediverse Isn't Growing, People's Accounts Vanish for Good
users' accounts will get deleted, not just become inactive
IBM is Failing, This Helps Show Wall Street is Entirely Detached From Actual Commercial Performance
IBM is unable to grow, it's just constantly shrinking
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Tuesday, September 30, 2025
IRC logs for Tuesday, September 30, 2025
Clerical Aspects of Publishing and Development
In Free software, the management aspects are considerably reduced
Slopwatch: Fake Articles and Google News Promoting "Linux" Spam or Bot-Generated Fear, Uncertainty, Doubt (FUD)
These slopfarms help misplace blame
Third Wave of Microsoft Layoffs in September, This Time Many in Liverpool Affected
Be ready for more waves of layoffs ahead of the so-called "results" in late October
Gemini Links 30/09/2025: Motorcycling in Central Oregon, Protocol Styles and the Flag of Sark
Links for the day
Links 30/09/2025: Death Sentences, Internet Censorship, and Internet Shutdowns
Links for the day
Gemini Links 30/09/2025: Social Control Media and ROOPHLOCH
Links for the day
Richard Stallman About to Give More Talks in Europe, Some Confirmed Already
In Göteborg
Links 30/09/2025: CERN in "Have I Been Pwned" and More Windows TCO Blunders
Links for the day
Microsoft Canonical is Selling Mass Surveillance and Back Doors as "Security for Ubuntu"
If you are looking for a GNU/Linux distro to use, just remember that Microsoft has Ubuntu in the bag
Justice for Wildlife
animals cannot speak to humans who hate animals
Cowboys Gonna Be Cowboys (on the Internet, They're Not a New Problem)
Boys will be boys
Cowboys of the "Left" and Cowboys of the "Right"
Don't believe the lie that this is some "leftist" thing
When Codes of Conduct Serve to Protect Criminals From Much-Deserved Scrutiny
CoCs are typically unfit for purpose because enforcement lacks context and suitable understanding of the full background (the "full story")
It Took the Open Source Initiative (OSI) 4+ Years to Address the 'Data Breach' or Data Protection Violation Reported to the California Privacy Protection Agency (CPPA) in March 2025
We may never know the dialogue or its nature
Even Microsoft's Biggest Boosters (and Media Operatives) Are Turning Against Microsoft
Expect many more layoffs before the fake "results" next month
GNU Was Right 42+ Years Ago
Since then the abusive, user-hostile technology has spread like mushrooms
Old Isn't Always Inadequate
How many gadgets manufactured today (in 2025) will still work in 2075?
The Monkey Business of Rust People
Compatibility won't matter
Almost Half of the FSFE's Money (the Fake 'FSF', Misusing the Brand) Comes From Vodafone
That money always comes with strings, even if they're invisible to most of us
Microsoft Lunduke Spreads Deliberate Lies to Incite Online Mobs
Has he lost his reading comprehension skills?
Our 19th Birthday (in Just Over 5 Weeks From Now)
We meanwhile have ongoing, solid plans to cover patent-related issues when the FSF turns 40
British GNU/Linux Distro FydeOS Tops DistroWatch
That seems like a decent site and decent effort to keep an eye on
We'll Soon Have 75,000 GemText Pages
avoid many perils of today's Web
Google Used Free Software to Build a Monopoly. Now Google Kicks Free Software to the Curb
The "G" in "Google" does not stand for GNU. It never did. It's just another greedy company.
Gemini Links 30/09/2025: Retro Hardware, Federated Fragmentation, and Nex Server Written in C
Links for the day
4 More Days Till "4 decades, 4 freedoms, 4 all users"
We are now just 4 days away from the rare anniversary
Two Months After Merging to Hide GitHub Losses Microsoft is Doing It Again (This Time Windows)
Merging those two together is not a sign of strength but a tightening of budget
Speculations About the Next Large Wave of IBM/Red Hat Layoffs
the mass layoffs are likely to happen on week 3 or 4 in October
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Monday, September 29, 2025
IRC logs for Monday, September 29, 2025
Links 29/09/2025: Opposition to Surveillance Giant Google and Conflicts Worldwide (Moldova Sides With EU)
Links for the day
Why the EPO Never Managed to Silence Us (After Over a Decade of Trying)
Firms like Mishcon de Reya and Brett Wilson LLP contribute to a bad stigma, staining the entire occupation
Links 29/09/2025: Datacenter Fires and "Too Much Internet Use Is Changing Teenage Brains"
Links for the day
Almost a Couple of Years After Microsoft Hijacked the Name 'Sudo' (to Describe Unrelated Windows Stuff) Microsoft Canonical Breaks Sudo in Ubuntu
These are vandals in "goodwill" or "security" clothing
Does the Good Law Project (GLP) Know the Director of Brett Wilson LLP Deems It OK to Endorse Violent Actions Against Trans People?
We were miffed to see this morning's report
Names Are Not Unique IDs and the UK Government's "Digital ID System" Would be a Nightmare
Digital surveillance, "apps", and worse (all the time)
What is Roy and Rianne's Righteously Royalty-free RSS Reader?
A news reader that uses OPML files and parses RSS feeds
The Free Software Foundation (FSF) Turns 40 in 5 Days
We should be talking about software freedom, not "Open Source"
It Feels Like Brett Wilson LLP Has Just Tacitly Admitted That It Defamed Me
It arguably admitted many other things by refusing to deny or address them (altogether)
Stefano Maffulli's Front Page Mentions "AI" 11 Times
They're more focused on slop (plagiarism) than sharing or Software Freedom
CMS Rot
With "modern" (bloated) content management systems (CMSs) there is a long chain of dependencies
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Sunday, September 28, 2025
IRC logs for Sunday, September 28, 2025
Slopwatch: Fake Articles About Linux 6.17 and Microsoft Meddling in Linux Development
today's Slopwatch is short because the picks are from Sunday
Gemini Links 29/09/2025: The Labor Wars and Retro
Links for the day