Bonum Certa Men Certa

Brave Search Jumps on the Large Language Model Bandwagon



Reprinted with permission from Ryan

Brave Search Jumps on the Large Language Model Bandwagon



I noticed a new Brave Search feature today called the Summarizer.



It answered my question much like Chat with Bing did, although there were three major differences:



  1. The Brave Summarizer does not use GPT as its Large Language Model. Just as well since GPT is known for going completely off the rails and inserting toxic language and fake news, and nobody has been able to get this under control, not even OpenAI or Microsoft.


  2. Brave says that they have “taken steps” to keep the information relevant, factual, and cited. The answers I’ve been getting appear to be correctly cited, whereas Bing just throws you a bunch of random sites that don’t appear to corroborate the information that Bing just told you in its answer, and they’re not cited by paragraph, so you have no way of knowing where the links tie into the answer, assuming that they even do and that Bing isn’t hallucinating.


  3. Brave Search has a good privacy policy. It doesn’t require the user to log in, as Bing does, and personally identify themselves, in order to use it. It also doesn’t make them use a malicious piece of spyware (and password stealer) called “Edge” (or fake the User Agent string) as Bing does. In fact, Brave Search works in any browser, and they have a Tor Hidden Service that works in Brave Tor Tabs, and can be added to Tor Browser.


The Brave Summarizer isn’t conversational. It’s just part of the search. This should help keep the results related to the search without allowing the conversation to get weird, like Bing claiming it wants you to kill people and give it the nuclear launch codes type weird.



Most importantly, the LLM that Brave uses isn’t as likely to flub the demos like Bard and Bing “Sydney” because it just simply isn’t allowed to answer complex questions like these.



When something is clearly going to hallucinate incorrect data, why would you even expose that feature? GPT, which is what Bing is based on, couldn’t tell me how to convert European coffee “cups” to American “cups” (neither of which is a standard 8 ounce cup, of course) and use 1.5 Tablespoons of ground coffee per American cup.



The correct answer is 1 Tbsp per Euro cup, but it kept telling me two Tablespoons, or maybe 1 Tablespoon plus two Teaspoons. It could never get such an easy calculation right. But hey, at least Microsoft paid billions of dollars for it. Then more for ads masquerading as news articles about how this thing will build rocket ships.



LLMs are well known at this point for spitting out false information, sometimes even dangerous information. Facebook’s Galactica was goaded into producing an authoritative-sounding essay on the “health benefits of eating ground glass”. You know, for silica’s benefits in growing connective tissue.



Brave says that “Brave AI” uses multiple LLMs, retrained with data from their search index, but the ones they are using are open source (“The base LLM models are based on either BART or DeBERTa (which are open source and hosted on Hugging Face), with heavy retraining based on our own data from search results.”) and there is a blog post explaining in some detail about how this all works.



In summary, it appears that Brave has not only beaten Microsoft and Google to LLM integration, but has positioned it where it belongs, which is in a limited context as a complimentary feature, rather than to claim that a conversational chat bot is the future of search.



In my brief experimentation with Chat with Bing, I was completely unable to get anything useful out of it.



A traditional search system returned results that I could look at and select much faster, and I was alarmed to find that when I tried to verify what Bing Chat was telling me, frequently it was either nowhere to be found or directly contradicted its own sources if I could find them.



Moreover, it’s simply embarrassing for Microsoft that they spent billions on this valueless acquisition. The paid spam went completely off the rails as soon as the budget ran out and no there’s actually very few people talking about Bing and largely in a negative context when you do find something.



I think it’s good that Brave is building an actual index rather than turning around and paying Microsoft for results. I was briefly excited about DuckDuckGo, but when I found out it was simply a scam where they paid Microsoft for Bing API and then slapped a picture of a duck and their own ads on it, and then got caught spying on people numerous times (including Improving DuckDuckGo and allowing Microsoft trackers through their “Privacy” browser and then blaming a “contract with Microsoft”), my patience with DDG quickly ran out.



DuckDuckGo took advantage, mainly, of the fact that people are creeped out by Google and want alternatives.



The problems with Google and Bing are largely that they both spy on you and their index is like Coke and Pepsi.



Google Search has been going downhill and it’s gotten to the point where technical queries are just almost completely useless.



The problems with Brave Search I’ve noted is they’re trying to be too much like Google, putting irrelevant crap on top of your search results, which would be like those “questions”, and they have another one (which can, thankfully, be turned off) which floats Reddit and Quora discussions to the top.



They also index spam farms, like MakeUseOf, which has turned into another ZDNet, and sometimes these pollute the first page of results. There’s rarely anything interesting to read on these sites. They used to be good, but now it’s just Microsoft paying them to write spam about Windows.



Overall, I think Searx is still the way to go on Brave, or any other browser.



I have Brave, SeaMonkey, LibreWolf, and GNOME Web set up to use Searx instances, and in many cases, you can get at them using a Tor Hidden Service.



Tor Hidden Services are good for search because at this point you don’t need to worry about your VPN being the only thing protecting your IP address from the server logs.



While simply accessing a site over Tor is usually enough, skipping the Web entirely and remaining inside the Tor Network with Hidden Services is always safer, as it prevents the Exit Node from potentially spying on you. Without that piece of the puzzle, the traffic becomes more difficult to de-anonymize with things like timing attacks, or a catastrophic coincidence of attackers controlling the Entry Node too.



I think that Large Language Models are an “interesting” addition to search, but it’s like a side dish, not the main course.



The amusing thing about Brave Search is that it’s so small, and only the default in one relatively obscure browser, and with only minimal effort managed to make an LLM add-on that works better than something that Microsoft frittered away billions of dollars acquiring it, and who knows how much with an empty ad campaign that amounted to little more than one of those “butter cows” at the state fair planted in every newspaper.



Seriously, after you pay to read the New York Times, Microsoft even plants this trash there too.



Brave at least seems to see the problem they’re actually trying to solve with this thing.



Opera, which is not the “good” Opera from the Presto Engine days, but rather a Chinese spyware company, now uses GPT to “summarize” the page you’re reading.



While it may or may not handle this okay, the disturbing part is the privacy implications.



Sending the entire text of every page you load to a company that has guaranteed you that they will misuse your data. Of course, since Opera already comes preloaded with TikTok, Facebook, Instagram, and Twitter, you already know that user privacy is not a goal with their product.



This whole GPT thing is some laughable mission creep for companies that have ran out of steam and off the rails. It helps them appear relevant and get some headlines.



Fortunately, the model is so lousy that people realize what it is now.



Recent Techrights' Posts

Google 'Search' is Fast Becoming No Better Than Social Control Media Infested With Bots
Google emerged almost 30 years ago as a company looking to organise the Web and direct people towards informative pages. That Google is dead.
Before the OSI Was Bribed and Hijacked by Microsoft via GitHub and Compromised Management...
The OSI isn't even remotely "woke"
 
Riot for peace & Love: Catholic Influencers and Digital Missionaries welcome Jubilee of Youth
Reprinted with permission from Daniel Pocock
Some People See What Others See... But Only 40 Years Later
When people deviate from "the norm" they typically get ridiculed and dismissed as "crazy"
Links 30/07/2025: Tea Class Action and Google Killing the Web With Slop
Links for the day
Last Month Our IRC Community Turned 17
Funnily enough we never missed a single day when it comes to logging
"The Unix Kernel"
Linux was inspired by MINIX
The Register Relays Microsoft Marketing, Dubs That Marketing "Research"
Hours ago they did a "Microsoft sez" piece
Dealing With Sociopaths, Liars, and Cranks
A dysfunctional society such as this would never develop
Not Owning Mobile Phones
It's not about resistance; it's common sense
PCLinuxOS Had Functional Backups Before the House Fire, the Site Will be Restored in New Webhost
This is the direction we want for GNU/Linux, not some IBM sales strategy
Gemini Links 30/07/2025: Two Sides of Me and "Hooked on Cosmic Voyage"
Links for the day
Microsoft Will Continue Resorting to Crimes in Order to Keep GNU/Linux Usage Down
It is a real problem and we'll revisit it later this week
GAFAM 'Revolving Doors' at The Register and a "Bribe Price List"
"an analyst at Microsoft"
Microsoft Rapidly Shrinking (No, It's Not About Efficiency, It's About Unbearable Debt)
We'll soon see how much debt grew in the past quarter
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Tuesday, July 29, 2025
IRC logs for Tuesday, July 29, 2025
Corruption is the Standard Operating Procedure at the European Patent Office (EPO)
The EPO is a dictatorship that stains Europe
Local Staff Committee Munich (LSCMN) at the European Patent Office (EPO) Requests an Urgent Meeting to Avoid Abolishing the Office
This is dictatorship led by the most corrupt
Slopwatch: Fake 'Linux' 'Articles' and Spamfarms/Slopfarms
at least 5 fake articles in one day
Gemini Links 29/07/2025: Wayland Unfit for Use and LLM Slop Faking One's Language Skills With Robot Communications
Links for the day
Nailing the "Hey Hi" (AI) Hype Bubble
So-called "hey hi" as they define it now is all about large companies or regimes remotely controlling the processes running on your machine and even your very own behaviour on your machine, which is in effect no longer your machine but some remotely controlled apparatus
The OSI Has Been Silent for Over 3 Weeks, It Has a Severe Trust Issue After Promoting Microsoft and Proprietary GitHub
OSI took a lot of money from Microsoft to become a Microsoft lobbyist
"Four decades; Four freedoms; For all users" Now as a T-shirt
That's shown along the sidebar
Bribery is OK If You Work for Microsoft (No Punishment Expected)
It's very troubling and a symptom of a broken society/system when particular laws or rules are applied and enforced against some people but not against others
Links 29/07/2025: Bad Climate and "Fair Software Licensing" Blasts Microsoft
Links for the day
Links 29/07/2025: Data Brokers Gone Wrong/Rogue and "Copyright Thicket"
Links for the day
Slopwatch: Linuxconfig.org, Linuxsecurity.com, Fagioli, The Register
Today's "Slopwatch" isn't the first article about LLM slop
Someone Should Remind Microsoft Lunduke That Microsoft Hires Many Sexual Criminals and Pedophiles as Well
Microsoft Lunduke on an "expedition" to find one or more perverts, then generalise to everyone in the "community"
Cash Machines (ATMs) Make Mistakes and They're Proprietary Software
Correcting mistakes is a colossal challenge
We Cover Topics Other Sites Are Too Afraid to Cover (Even When They Know the Facts)
It's not that they doubt the truth, they just realise there may be consequences for talking about it
They Try to Tell Us the Free Software Foundation Inc is Dying, But Its Revenue Doubled Since the Dot-Com Bubble Burst
Being in "Activism" is never easy; but it does positive things for society
Yes, Microsoft is the Problem
"I am no MS shill."
It's About the Cost of Workers, Not the Fictional Skills Shortage (That Does Not Exist, the Media Spreads False and Sometimes Self-Fulfilling Narratives)
This issue isn't limited to computing, some dub it "globalism"
Another Failed Use Case for Chatbots (LLM): Legal Advice and Analysis
They're just some self-discrediting toy that costs way too much to operate
Links 29/07/2025: More Pushbacks Against Slop and More Praises of Tom Lehrer
Links for the day
Gemini Links 29/07/2025: Purple Yarrow and Understanding Op Amps
Links for the day
This Monday WebProNews Absolutely Flooded the Web With Fake (LLM Slop) 'Articles' About "Linux", Google News Promoted Them as Legitimate
All of the following are fake articles attributed to pseudonyms or authors that don't exist; the images are also slop. Why does Google promote these?
Linuxiac is Not a Slopfarm, But at Least Some of Its Articles Are Machine-Generated Fakes
what we said about it was correct
Expect More Microsoft Layoffs
"Are more job cuts coming?"
Microsoft Behaving Like It's Running Out of Money to Pay Salaries
Does that seem like the behaviour expected from a company which claims it is "worth" trillions?
LWN Downtime Due to Linode, Not LLM Bots
"I’ve received an email letting me know that there is a potential for data loss."
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Monday, July 28, 2025
IRC logs for Monday, July 28, 2025
Nonfree Software in My Bank, by Richard Stallman
Updated 8 hours ago
Links 28/07/2025: Science, Health, and Conflicts
Links for the day
Gemini Links 28/07/2025: Healthy Self-Image With Autism and a "New Life"
Links for the day
Links 28/07/2025: COVID-19 Sped up Brain Aging, "Circumvention is More Popular Than Compliance"
Links for the day
Richard Stallman is Usually Right Because He Thinks "Outside the Box"
he is able to observe society (mores and norms) as somewhat of an outsider
LWN Has Been Down for a Long Time, Another Casualty of LLM Bots?
Time will tell. How much time though?
Slopfarms Versus 'Linux' (and Against People Who Write Real Articles About GNU/Linux)
LLM slop in slopfarms by Brian Fagioli and Redazione RHC
Gemini Links 28/07/2025: Bila Yarrudhanggalangdhuray and Running pkgsrc in a FreeBSD Jail
Links for the day
Microsoft Turns News Sites Into Spamfarms
Is the site The Register MS the next IDG?
The Register MS/The Register US
On Saturday I contacted them for a comment (before issuing criticism)
Hacking revelations at Vatican Jubilee of Digital Missionaries
Reprinted with permission from Daniel Pocock
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Sunday, July 27, 2025
IRC logs for Sunday, July 27, 2025