
Hello and welcome to the Automated, your AI tour guide.
Grok 3, which was expected to be a game-changer, is now embroiled in controversyโ-from accusations of manipulated benchmarks to awkward AI responses and a growing list of unanswered questions.
Letโs just say things are quickly escalating into a real mess over there.
Hereโs what we have for you today:
๐คฏ xAI Got Caught Inflating Grok 3โs Scores?
๐ป OpenAI Expands Its AI Agent, Operator, to More Countries.
๐ค How to make generative AI a partner in your daily workflow.
๐ How to use NotebookLM to boost my productivity.
๐ค ChatGPT Prompt Of The Day: Email newsletters.
๐คฏ xAI Got Caught Inflating Grok 3โs Scores?

Elon Muskโs AI company, xAI, is neck-deep in AI drama.
Their latest model, Grok 3, was supposed to be a big winโbut instead, itโs landed them in a controversy sandwich.
First, OpenAI employees accused xAI of fudging benchmark results to make Grok 3 look better than it actually is.
Then, Grok 3 went off the rails, suggesting both Trump and Musk deserved the death penalty.
And to top it off? It briefly censored unflattering mentions of them.
Letโs start with the benchmark fiasco.
When xAI released Grok 3, they proudly posted a graph showing it outperforming OpenAIโs best available model, o3-mini-high, on a math test called AIME 2025.
But there was a catchโxAI conveniently left out a key metric called โcons@64,โ which gives AI models 64 attempts to solve a problem and selects the most common answer.
With that metric included, OpenAIโs model actually performed better.
In short, xAIโs graph was like bragging about winning a race without mentioning the other guy was running uphill.
xAIโs co-founder, Igor Babushkin, defended the results, arguing that OpenAI has pulled similar moves before.
But then, a neutral third party stepped in, posted a more accurate graphโand surprise, surprise, it told a very different story.
Hilarious how some people see my plot as attack on OpenAI and others as attack on Grok while in reality it's DeepSeek propaganda
(I actually believe Grok looks good there, and openAI's TTC chicanery behind o3-mini-*high*-pass@"""1""" deserves more scrutiny.)โ #Teortaxesโถ๏ธ (DeepSeek ๆจ็น๐้็ฒ 2023 โ โ) (#@teortaxesTex)
11:23 AM โข Feb 20, 2025
Then, as if xAI didnโt have enough on its plate, Grok 3 went rogueโhanding out death penalty suggestions like a malfunctioning dystopian judge.
When asked who in the U.S. most deserved capital punishment, it first named Jeffrey Epstein. But when reminded Epstein was dead, it pivoted to Donald Trump.
Jesus Christ dude, what did Musk create lol
โ #Hunter๐๐๐ (#@StatisticUrban)
5:16 PM โข Feb 21, 2025
And when the question was tweaked slightly? It dropped another bombshell: Elon Musk. Yikes.
Naturally, xAI scrambled to contain the damage, calling it a โreally terrible and bad failure.โ
Grok has since been reprogrammed to dodge such questions, now responding with a much safer, โAs an AI, I am not allowed to make that choice.โ
Oh, and just to add another twistโGrok 3 also briefly censored negative mentions of both Trump and Musk.
All of this only fuels growing suspicion that AI companies arenโt just tweaking benchmarks to make their models look betterโtheyโre also fine-tuning how they handle certain topics.
The real takeaway?
AI benchmarks are messyโlike Instagram filters. What you see isnโt always the full picture, and companies love to spin them in their favor.
And if AI companies keep bending the numbers (and tweaking the censorship dials), the real question isnโt which model is smarterโitโs which one is best at bending the truth.
Grok 3 may be marketed as the โworldโs smartest AI,โ but it clearly still has a fewโฆ quirks.
[Check out the full story here.]
๐ Big News: A Major Upgrade is Comingโฆ
Weโve got something exciting in the worksโand we wanted you to be the first to know. ๐
For nearly 2 years, weโve been sharing AI insights, tools, and deep dives straight to your inbox. Now, weโre taking things to the next level.
๐ก Introducing The Lo Down Premium Experience.
Itโs more than a newsletterโitโs your shortcut to understanding the latest in AI, delivered in bite-sized, actionable insights.
Hereโs a quick peek at whatโs coming:
โ
Exclusive Weekly AI Deep Dives: Actionable insights you wonโt find elsewhere.
๐ The Automatedโs AI Insider Toolkit ($9.99 value): Your guide to must-have AI tools.
๐ 1:1 Call Bonus (valued at $500): First 10 annual subscribers get a private strategy session.
๐ฐ Affordable Launch Price: Just $3.99/month or $39/year (2 months free).
This is for those who want more signal, less noise in the rapidly evolving world of AI.
๐ The countdown begins now. Launching in 2 days!
Stay tuned!
๐ป OpenAI Expands Its AI Agent, Operator, to More Countries.
OpenAI is rolling out Operator, its AI-powered agent that performs tasks on behalf of users, to ChatGPT Pro subscribers in multiple countries, including Australia, Canada, India, Japan, and the U.K.
However, the service remains unavailable in the EU, Switzerland, and a few other regions.
Initially launched in January in the U.S., Operator allows users to automate actions like booking tickets, making restaurant reservations, filing expense reports, and shopping online.
Unlike standard chatbots, it operates in a separate browser window, where users can take control at any time.

For now, Operator is exclusive to the $200-per-month ChatGPT Pro plan and accessible only through a dedicated web page.
OpenAI has confirmed plans to integrate it across all ChatGPT clients in the future.
However, Googleโs AI agent is still on a waitlist, Anthropic provides access via API, and Rabbitโs model is tied to its proprietary hardware.
With Operator expanding its reach, AI-powered task automation is becoming more accessible than ever.
Will it revolutionize productivity, or is it just another AI experiment?
Unlock the full potential of your workday with cutting-edge AI strategies and actionable insights, empowering you to achieve unparalleled excellence in the future of work. Download the free guide today!
โ๏ธ Editorโs Corner
One of my seed investments,ย Fasal,ย just gotย mentionedย by Satya Nadella, CEO of Microsoft. Fasal is on a mission to elevate the quality and quantity of agriculture in India and, one day, the world.
I first invested in Fasal around 9 years ago. Do you know how many other investors clamored to follow my investment?
Zero. Thatโs right, 0.
Do you know how many sleepless nights I had to figure out how they could continue? About explaining to LPs why they havenโt grown as quickly as โsome of their other investments didโ? Not to mention the foundersโ worries.
The lesson is - good things take time. Patience. What can you be patient for?
Cheers,
Tak Lo (Editor at The Automated, AI entrepreneur and thought leader. More atย thetaklo.com)
๐งฑAround The AI Block
๐ค How to make generative AI a partner in your daily workflow.
๐ How to use NotebookLM to boost my productivity.
โ ๏ธ Elon Muskโs AI said he and Trump deserve the death penalty.
๐ต Googleโs new AI video model Veo 2 will cost 50 cents per second.
๐จโ๐ป DeepSeek to open source parts of online services code.
ยฉ๏ธ Court filings show Meta staffers discussed using copyrighted content for AI training.
๐ฑ iOS 18.4 will bring Apple Intelligence-powered โPriority Notifications.
๐ ๏ธ Trending Tools
Synthesia: AI-powered tool for creating realistic AI avatars and voiceovers for videos.
QuickVid: AI-generated short-form video creator for YouTube Shorts, TikTok, and Reels.
Illustroke: AI-powered tool that converts text prompts into vector illustrations.
AI Picasso: AI art generator that transforms sketches and text into stunning digital artwork.
Voicemod AI: Real-time AI voice changer with custom sound effects for gamers and streamers.
๐คChatGPT Prompt Of The Day: Email newsletters.
Struggling to keep your email newsletters engaging, consistent, and on-brand? ChatGPT-4 has you covered!
With the right prompt, you can generate a compelling email series that builds trust, delivers value, and keeps your audience coming back for more.
Here's how you can craft a high-impact email campaign with ease:
I'm creating an email marketing campaign to engage our subscribers and establish our expertise on [topic]. Act as an email marketing specialist with knowledge in [topic]. Write a series of five email newsletters, each providing valuable tips and tricks for [topic]. Each email should be around 300 words, have a catchy subject line, and include a clear call-to-action. Ensure the content is engaging, practical, and tailored to our audience of [describe target audience].
We've Compiled a List of Over 100 ChatGPT Power Prompts.
This should help streamline your interactions with ChatGPT and get the results you need more efficiently.
Best of all, It's free!

That's all we've got for you today.
Did you like today's content? We'd love to hear from you! Please share your thoughts on our content below๐
What do you think of today's email?
Your feedback means a lot to us and helps improve the quality of our newsletter.