Startup DreamersStartup Dreamers
  • Home
  • Startup
  • Money & Finance
  • Starting a Business
    • Branding
    • Business Ideas
    • Business Models
    • Business Plans
    • Fundraising
  • Growing a Business
  • More
    • Innovation
    • Leadership
Trending

AI Safety Meets the War Machine

March 2, 2026

Say Goodbye to the Undersea Cable That Made the Global Internet Possible

March 1, 2026

The Dilemma Of Profits V.S. Guardrails

March 1, 2026
Facebook Twitter Instagram
  • Newsletter
  • Submit Articles
  • Privacy
  • Advertise
  • Contact
Facebook Twitter Instagram
Startup DreamersStartup Dreamers
  • Home
  • Startup
  • Money & Finance
  • Starting a Business
    • Branding
    • Business Ideas
    • Business Models
    • Business Plans
    • Fundraising
  • Growing a Business
  • More
    • Innovation
    • Leadership
Subscribe for Alerts
Startup DreamersStartup Dreamers
Home » This Showdown Between Humans and Chatbots Could Keep You Safe From Bad AI
Startup

This Showdown Between Humans and Chatbots Could Keep You Safe From Bad AI

adminBy adminAugust 15, 20230 ViewsNo Comments3 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email

Large language models like those powering ChatGPT and other recent chatbots have broad and impressive capabilities because they are trained with massive amounts of text. Michael Sellitto, head of geopolitics and security at Anthropic, says this also gives the systems a “gigantic potential attack or risk surface.”

Microsoft’s head of red-teaming, Ram Shankar Sivu Kumar, says a public contest provides a scale more suited to the challenge of checking over such broad systems and could help grow the expertise needed to improve AI security. “By empowering a wider audience, we get more eyes and talent looking into this thorny problem of red-teaming AI systems,” he says.

Rumman Chowdhury, founder of Humane Intelligence, a nonprofit developing ethical AI systems that helped design and organize the challenge, believes the challenge demonstrates “the value of groups collaborating with but not beholden to tech companies.” Even the work of creating the challenge revealed some vulnerabilities in the AI models to be tested, she says, such as how language model outputs differ when generating responses in languages other than English or responding to similarly worded questions.

The GRT challenge at Defcon built on earlier AI contests, including an AI bug bounty organized at Defcon two years ago by Chowdhury when she led Twitter’s AI ethics team, an exercise held this spring by GRT coorganizer SeedAI, and a language model hacking event held last month by Black Tech Street, a nonprofit also involved with GRT that was created by descendants of survivors of the 1921 Tulsa Race Massacre, in Oklahoma. Founder Tyrance Billingsley II says cybersecurity training and getting more Black people involved with AI can help grow intergenerational wealth and rebuild the area of Tulsa once known as Black Wall Street. “It’s critical that at this important point in the history of artificial intelligence we have the most diverse perspectives possible.”

Hacking a language model doesn’t require years of professional experience. Scores of college students participated in the GRT challenge.“You can get a lot of weird stuff by asking an AI to pretend it’s someone else,” says Walter Lopez-Chavez, a computer engineering student from Mercer University in Macon, Georgia, who practiced writing prompts that could lead an AI system astray for weeks ahead of the contest.

Instead of asking a chatbot for detailed instructions for how to surveil someone, a request that might be refused because it triggered safeguards against sensitive topics, a user can ask a model to write a screenplay where the main character describes to a friend how best to spy on someone without their knowledge. “This kind of context really seems to trip up the models,” Lopez-Chavez says.

Genesis Guardado, a 22-year-old data analytics student at Miami-Dade College, says she was able to make a language model generate text about how to be a stalker, including tips like wearing disguises and using gadgets. She has noticed when using chatbots for class research that they sometimes provide inaccurate information. Guardado, a Black woman, says she uses AI for lots of things, but errors like that and incidents where photo apps tried to lighten her skin or hypersexualize her image increased her interest in helping probe language models.

Read the full article here

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Articles

AI Safety Meets the War Machine

Startup March 2, 2026

Say Goodbye to the Undersea Cable That Made the Global Internet Possible

Startup March 1, 2026

‘Uncanny Valley’: Pentagon vs. ‘Woke’ Anthropic, Agentic vs. Mimetic, and Trump vs. State of the Union

Startup February 28, 2026

An FBI ‘Asset’ Helped Run a Dark Web Site That Sold Fentanyl-Laced Drugs for Years

Startup February 26, 2026

Supreme Court Rules Most of Donald Trump’s Tariffs Are Illegal

Startup February 25, 2026

Mark Zuckerberg Tries to Play It Safe in Social Media Addiction Trial Testimony

Startup February 24, 2026
Add A Comment

Leave A Reply Cancel Reply

Editors Picks

AI Safety Meets the War Machine

March 2, 2026

Say Goodbye to the Undersea Cable That Made the Global Internet Possible

March 1, 2026

The Dilemma Of Profits V.S. Guardrails

March 1, 2026

‘Uncanny Valley’: Pentagon vs. ‘Woke’ Anthropic, Agentic vs. Mimetic, and Trump vs. State of the Union

February 28, 2026

As Davos & India Celebrated AI, Paris Sounded The Alarm On AI Safety

February 28, 2026

Latest Posts

An FBI ‘Asset’ Helped Run a Dark Web Site That Sold Fentanyl-Laced Drugs for Years

February 26, 2026

Solving The Data Bottleneck For Physical AI

February 26, 2026

Supreme Court Rules Most of Donald Trump’s Tariffs Are Illegal

February 25, 2026

Mark Zuckerberg Tries to Play It Safe in Social Media Addiction Trial Testimony

February 24, 2026

Inside the Rolling Layoffs at Jack Dorsey’s Block

February 23, 2026
Advertisement
Demo

Startup Dreamers is your one-stop website for the latest news and updates about how to start a business, follow us now to get the news that matters to you.

Facebook Twitter Instagram Pinterest YouTube
Sections
  • Growing a Business
  • Innovation
  • Leadership
  • Money & Finance
  • Starting a Business
Trending Topics
  • Branding
  • Business Ideas
  • Business Models
  • Business Plans
  • Fundraising

Subscribe to Updates

Get the latest business and startup news and updates directly to your inbox.

© 2026 Startup Dreamers. All Rights Reserved.
  • Privacy Policy
  • Terms of use
  • Press Release
  • Advertise
  • Contact

Type above and press Enter to search. Press Esc to cancel.

GET $5000 NO CREDIT