Bonum Certa Men Certa

Brave Search Jumps on the Large Language Model Bandwagon



Reprinted with permission from Ryan

Brave Search Jumps on the Large Language Model Bandwagon



I noticed a new Brave Search feature today called the Summarizer.



It answered my question much like Chat with Bing did, although there were three major differences:



  1. The Brave Summarizer does not use GPT as its Large Language Model. Just as well since GPT is known for going completely off the rails and inserting toxic language and fake news, and nobody has been able to get this under control, not even OpenAI or Microsoft.


  2. Brave says that they have “taken steps” to keep the information relevant, factual, and cited. The answers I’ve been getting appear to be correctly cited, whereas Bing just throws you a bunch of random sites that don’t appear to corroborate the information that Bing just told you in its answer, and they’re not cited by paragraph, so you have no way of knowing where the links tie into the answer, assuming that they even do and that Bing isn’t hallucinating.


  3. Brave Search has a good privacy policy. It doesn’t require the user to log in, as Bing does, and personally identify themselves, in order to use it. It also doesn’t make them use a malicious piece of spyware (and password stealer) called “Edge” (or fake the User Agent string) as Bing does. In fact, Brave Search works in any browser, and they have a Tor Hidden Service that works in Brave Tor Tabs, and can be added to Tor Browser.


The Brave Summarizer isn’t conversational. It’s just part of the search. This should help keep the results related to the search without allowing the conversation to get weird, like Bing claiming it wants you to kill people and give it the nuclear launch codes type weird.



Most importantly, the LLM that Brave uses isn’t as likely to flub the demos like Bard and Bing “Sydney” because it just simply isn’t allowed to answer complex questions like these.



When something is clearly going to hallucinate incorrect data, why would you even expose that feature? GPT, which is what Bing is based on, couldn’t tell me how to convert European coffee “cups” to American “cups” (neither of which is a standard 8 ounce cup, of course) and use 1.5 Tablespoons of ground coffee per American cup.



The correct answer is 1 Tbsp per Euro cup, but it kept telling me two Tablespoons, or maybe 1 Tablespoon plus two Teaspoons. It could never get such an easy calculation right. But hey, at least Microsoft paid billions of dollars for it. Then more for ads masquerading as news articles about how this thing will build rocket ships.



LLMs are well known at this point for spitting out false information, sometimes even dangerous information. Facebook’s Galactica was goaded into producing an authoritative-sounding essay on the “health benefits of eating ground glass”. You know, for silica’s benefits in growing connective tissue.



Brave says that “Brave AI” uses multiple LLMs, retrained with data from their search index, but the ones they are using are open source (“The base LLM models are based on either BART or DeBERTa (which are open source and hosted on Hugging Face), with heavy retraining based on our own data from search results.”) and there is a blog post explaining in some detail about how this all works.



In summary, it appears that Brave has not only beaten Microsoft and Google to LLM integration, but has positioned it where it belongs, which is in a limited context as a complimentary feature, rather than to claim that a conversational chat bot is the future of search.



In my brief experimentation with Chat with Bing, I was completely unable to get anything useful out of it.



A traditional search system returned results that I could look at and select much faster, and I was alarmed to find that when I tried to verify what Bing Chat was telling me, frequently it was either nowhere to be found or directly contradicted its own sources if I could find them.



Moreover, it’s simply embarrassing for Microsoft that they spent billions on this valueless acquisition. The paid spam went completely off the rails as soon as the budget ran out and no there’s actually very few people talking about Bing and largely in a negative context when you do find something.



I think it’s good that Brave is building an actual index rather than turning around and paying Microsoft for results. I was briefly excited about DuckDuckGo, but when I found out it was simply a scam where they paid Microsoft for Bing API and then slapped a picture of a duck and their own ads on it, and then got caught spying on people numerous times (including Improving DuckDuckGo and allowing Microsoft trackers through their “Privacy” browser and then blaming a “contract with Microsoft”), my patience with DDG quickly ran out.



DuckDuckGo took advantage, mainly, of the fact that people are creeped out by Google and want alternatives.



The problems with Google and Bing are largely that they both spy on you and their index is like Coke and Pepsi.



Google Search has been going downhill and it’s gotten to the point where technical queries are just almost completely useless.



The problems with Brave Search I’ve noted is they’re trying to be too much like Google, putting irrelevant crap on top of your search results, which would be like those “questions”, and they have another one (which can, thankfully, be turned off) which floats Reddit and Quora discussions to the top.



They also index spam farms, like MakeUseOf, which has turned into another ZDNet, and sometimes these pollute the first page of results. There’s rarely anything interesting to read on these sites. They used to be good, but now it’s just Microsoft paying them to write spam about Windows.



Overall, I think Searx is still the way to go on Brave, or any other browser.



I have Brave, SeaMonkey, LibreWolf, and GNOME Web set up to use Searx instances, and in many cases, you can get at them using a Tor Hidden Service.



Tor Hidden Services are good for search because at this point you don’t need to worry about your VPN being the only thing protecting your IP address from the server logs.



While simply accessing a site over Tor is usually enough, skipping the Web entirely and remaining inside the Tor Network with Hidden Services is always safer, as it prevents the Exit Node from potentially spying on you. Without that piece of the puzzle, the traffic becomes more difficult to de-anonymize with things like timing attacks, or a catastrophic coincidence of attackers controlling the Entry Node too.



I think that Large Language Models are an “interesting” addition to search, but it’s like a side dish, not the main course.



The amusing thing about Brave Search is that it’s so small, and only the default in one relatively obscure browser, and with only minimal effort managed to make an LLM add-on that works better than something that Microsoft frittered away billions of dollars acquiring it, and who knows how much with an empty ad campaign that amounted to little more than one of those “butter cows” at the state fair planted in every newspaper.



Seriously, after you pay to read the New York Times, Microsoft even plants this trash there too.



Brave at least seems to see the problem they’re actually trying to solve with this thing.



Opera, which is not the “good” Opera from the Presto Engine days, but rather a Chinese spyware company, now uses GPT to “summarize” the page you’re reading.



While it may or may not handle this okay, the disturbing part is the privacy implications.



Sending the entire text of every page you load to a company that has guaranteed you that they will misuse your data. Of course, since Opera already comes preloaded with TikTok, Facebook, Instagram, and Twitter, you already know that user privacy is not a goal with their product.



This whole GPT thing is some laughable mission creep for companies that have ran out of steam and off the rails. It helps them appear relevant and get some headlines.



Fortunately, the model is so lousy that people realize what it is now.



Recent Techrights' Posts

Two Risks to Companies: The Microsoft Culture and the Microsoft Tools
Novell was killed by a form of "social engineering" by Microsoft
It's Hard to Trust People Who Worked - Not Only Those Who Still Work - at Microsoft
Bryan Lunduke is just what people would call an "arsehole of a person"
Links 06/07/2025: Climate Change and "The Right to Criticise"
Links for the day
The Mainstream Media Took 4 Days to Realise Microsoft Shut Down Its Operations in Pakistan and Fired Everybody
We estimate that Microsoft has had about 29,000 layoffs since January
“Twibel” Actions Against Comedians (and Why It's a Truly Low Blow)
they try to make up in quantities for a lack of merit or quality
 
[Video] "Copyleft Isn't a Bug."
"Copyleft isn’t a bug. It’s a feature. GNU GPL forced the world to treat code like a public good."
Being in Social Control Media Means Exposing Oneself to Heckling
Richard Stallman does not (either himself or directly) post to any social control media
Links 06/07/2025: Airlines Perils, Scams, and Breaches
Links for the day
For the Second Time, Bryan Lunduke From Microsoft is Siccing Racist Trolls and Vandals at Me
You're only reinforcing the point we made yesterday
Links 06/07/2025: End to End Encryption at Risk, Reuters Twitter ("X") Account Withheld in India
Links for the day
Gemini Links 06/07/2025: Tinylog and Certification Rotation
Links for the day
PCLinuxOS Sites Coming Back, Gradually
let's just be patient
Social Control Media, Even If Based on Free Software, Still Has Many Problems
a distraction from what actually mattered and still matters
IBM is Not Your Master
IBM makes friends with people who exclude the majority of the population: women
Help Fund the Free Software Foundation (FSF)
If you have some dollars to spare, go support the FSF
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Saturday, July 05, 2025
IRC logs for Saturday, July 05, 2025
A Short History of Attacks on Techrights (and Boycott Novell Before That)
good opportunity to tell again the story of several (not all) attempts to silence us
Leadership in Free Software
Don't let IBM lead. It's a terrible flag bearer.
Linux Foundation Apparently Flirting With Slop (Marketing by LLM-Generated SPAM)
The Web is in a really bad state!
COVID-19 Sped Up Site Improvements in Techrights
A few months later we created our very own IRC network
Gemini Links 05/07/2025: Negative Questions and 'Touching Grass' (Going Outside)
Links for the day
Links 05/07/2025: Dalai Lama Succession as 90th Birthday Approaches, 40 deg C in China
Links for the day
Links 05/07/2025: Hungary and US Defecting to Russia, "Google's Hotseat Hypocrisy"
Links for the day
Gemini Links 05/07/2025: 4th of July 2025 and "Zig Roadmap 2026"
Links for the day
How to Combat the Exploitation and Abuse by Microsoft GitHub
Not to mention corruption and crimes against women
Bryan Lunduke is Actually Sending His Audience to Attack People
"[Lunduke] is actually sending his audience to attack people."
Even The Right Wing is Rejecting Bryan Lunduke
no wonder he became so irrelevant and marginal
Microsoft's MSN Helps Microsoft Spread Lies About the Layoffs' Scale (Well Over 25,000 People Laid Off This Year)
There seem to be monopolies on lies and on truth
The Death of X Has Been Greatly Exaggerated (by Compromised Media)
X.Org Server is alive and well
Rewriting Things in Rust
How far would you go?
In 2025 Everything is "AI". Remember Blockchains?
Talk about what companies and things (services, products, software) actually do, not the labels they use
Julian Assange Has Been Free for a Year
Julian Assange and I disagreed on some things
Monopolies and Scalping
Monopolies gravitate towards price hikes
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Friday, July 04, 2025
IRC logs for Friday, July 04, 2025
Microsoft's August Layoffs Wave: "August is Confirmed for Additional Performance Based Cuts"
"August is confirmed for additional performance based cuts from the recent connects along with additional organizational cuts."
What Microsoft Reputation Laundering (With a Weaponised Law Degree) Looks Like in a Foreign Continent
You would expect this in uncivilised and primitive countries
Slopwatch: LLMs 'Write' Fake or Distorted 'News' About "Linux"
LLM slop disguised as news
Links 04/07/2025: Google Replaces the Web With Slop, "AI Might Kill Us All"
Links for the day
Gemini Links 04/07/2025: Mindfulness and F1
Links for the day
Weeks After Microsoft Bankruptcy in Russia the Company Shuts Down in Pakistan, Too
Last month Windows' share in Pakistan fell to an all-time low
Rob Musial's June 2025 Additions of Malware in Proprietary Software
Via the GNU Web site this week
Links 04/07/2025: Microsoft's H-1B Visa Applications Show Another Crisis Unfolding, Many More Deep Cuts and Shutdowns Revealed, Complete Microsoft Exits
Links for the day
Gemini Links 04/07/2025: A Day To Remember and "Stop Killing Games"
Links for the day
Crime and Corruption at Microsoft GitHub Cannot be Covered Up by SLAPPs in Another Continent
We'll write about this for a long time to come
Slop Videos Are Disappointing Garbage, Nothing New, Just Brute Force up on Display or a Pedestal of Slop
Slop videos aren't a new thing
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Thursday, July 03, 2025
IRC logs for Thursday, July 03, 2025
The War on Local Storage (People Hosting Their Files Locally and Privately)
There's nothing wrong with controlling one's computing
What Digital Independence Means
Independence in the digital realms means abandoning platforms like GitHub, not just rejecting proprietary software
NVidia is a Bubble
they temporarily see fortunes and wrongly assume perpetuity thereof
Fedora Does Not Care About Diversity and Inclusion, It's About Optics (Corporate Image)
any notion of inclusion is superficial and misleading
Don't Buy the Excuses for Microsoft's Mass Layoffs
Back in the 90s, Microsoft bought a lot of companies to get and stay ahead