Startup DreamersStartup Dreamers
  • Home
  • Startup
  • Money & Finance
  • Starting a Business
    • Branding
    • Business Ideas
    • Business Models
    • Business Plans
    • Fundraising
  • Growing a Business
  • More
    • Innovation
    • Leadership
Trending

Claressa Shields Tags 3 Legends In Latest Callout

July 16, 2025

AI Agents Are Rewriting the Rules of Retail — Even for the Little Guys

July 16, 2025

Seagate HDDs For AI And Panmnesia’s Composable AI Infrastructure

July 15, 2025
Facebook Twitter Instagram
  • Newsletter
  • Submit Articles
  • Privacy
  • Advertise
  • Contact
Facebook Twitter Instagram
Startup DreamersStartup Dreamers
  • Home
  • Startup
  • Money & Finance
  • Starting a Business
    • Branding
    • Business Ideas
    • Business Models
    • Business Plans
    • Fundraising
  • Growing a Business
  • More
    • Innovation
    • Leadership
Subscribe for Alerts
Startup DreamersStartup Dreamers
Home » Salesforce Introduces XGen-7B, A Large Language Model With Longer Context Support
Innovation

Salesforce Introduces XGen-7B, A Large Language Model With Longer Context Support

adminBy adminJuly 3, 20231 ViewsNo Comments3 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email

The race to release open source generative AI models is heating up. Salesforce has joined the bandwagon by launching XGen-7B, a large language model that supports longer context windows than the available open source LLMs.

The 7B in XGen-7B LLM represents 7 billion parameters. The larger the number of parameters, the bigger the model. Models with larger parameters, such as 13 billion tokens, require high-end CPUs, GPUs, RAM, and storage. But the larger model size helps get an accurate response since it is trained on larger data corpora. So, it’s a tradeoff between size and accuracy.

One of the key differentiators of XGen-7B is the 8K context window. A larger context window translates to a large prompt and the output generated by the model. This means it’s possible to send prompts with additional context to the model and get longer responses. The 8K context window is the cumulative size of the input and output text.

Let’s understand what a token is. Since machine learning models understand numbers and not characters, each word or a part of it is converted into a token. A token is a way to encode text, like ASCII or Unicode. To turn words into tokens, XGen-7B uses the OpenAI tokenizing system used with its popular models, such as GPT-3 and GPT-4.

XGen-7B becomes an alternative to open source LLMs such as MPT, Falcon, and LLaMa. Salesforce claims that its LLM achieves comparable or better results than the current state-of-the-art language models of similar size.

Salesforce releases three variants of the XGen-7B. The first one, XGen-7B-4K-base, supports a 4K context window, while the second variant, XGen-7B-8K-base, is trained with additional data with support for an 8K context length. Both of these variants are released under the Apache 2.0 open source license, which allows commercial usage.

The third variant, XGen-7B-{4K,8K}-inst, is trained on instructional data including databricks-dolly-15k, oasst1, Baize and GPT-related datasets, which are available only for research purposes. The instruct keyword in the name indicates that the model can understand instructions and has been trained based on reinforcement learning from human feedback (RLHF) techniques. An instruction-based language model can be used to build chatbots similar to ChatGPT.

Salesforce has used multiple datasets, such as RedPajama and Wikipedia, and Salesforce’s own dataset, Starcoder, to train the XGen-7B LLM. Based on Google Cloud pricing for TPU-v4, the training cost of the model is $150K on 1T tokens. The model is trained on 22 different languages to make it multilingual.

Salesforce’s XGen-7B supports Massive Multitask Language Understanding, which is the ability to answer multiple-choice questions from various branches of knowledge such as the humanities, STEM, Social Sciences, and other domains. The XGen-7B scores better than other models in this category.

The XGen-7B model also does well in other categories, such as conversations, long-form Q&A and summarization.

Salesforce also added a disclaimer stating that their LLM is subject to the same limitations as other LLMs, such as bias, toxicity, and hallucinations.

With a larger context window and a comprehensive set of datasets used for training, the XGen-7B LLM from Salesforce looks promising.

Read the full article here

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Articles

Claressa Shields Tags 3 Legends In Latest Callout

Innovation July 16, 2025

Seagate HDDs For AI And Panmnesia’s Composable AI Infrastructure

Innovation July 15, 2025

A Cybersecurity Primer For Businesses In 2025

Innovation July 14, 2025

Today’s Extra Clues And Answers

Innovation July 13, 2025

One Of The Best Action Movies Ever Made Lands On Netflix Today

Innovation July 12, 2025

Today’s NYT Mini Crossword Clues And Answers For Friday, July 11th

Innovation July 11, 2025
Add A Comment

Leave A Reply Cancel Reply

Editors Picks

Claressa Shields Tags 3 Legends In Latest Callout

July 16, 2025

AI Agents Are Rewriting the Rules of Retail — Even for the Little Guys

July 16, 2025

Seagate HDDs For AI And Panmnesia’s Composable AI Infrastructure

July 15, 2025

How Much Money You Need to Be Wealthy: Survey

July 15, 2025

‘People Are Going to Die’: A Malnutrition Crisis Looms in the Wake of USAID Cuts

July 15, 2025

Latest Posts

Why Surcharging Is a Bad Move For Small Businesses — and What to Do Instead

July 14, 2025

Can’t Get an Email Back? These 7 Tips Will Make Sure You Get a Response Every Time

July 14, 2025

How to Build a Side Hustle That Stands on Its Own — Without Burning Out

July 14, 2025

Tornado Cash Made Crypto Anonymous. Now One of Its Creators Faces Trial

July 14, 2025

Today’s Extra Clues And Answers

July 13, 2025
Advertisement
Demo

Startup Dreamers is your one-stop website for the latest news and updates about how to start a business, follow us now to get the news that matters to you.

Facebook Twitter Instagram Pinterest YouTube
Sections
  • Growing a Business
  • Innovation
  • Leadership
  • Money & Finance
  • Starting a Business
Trending Topics
  • Branding
  • Business Ideas
  • Business Models
  • Business Plans
  • Fundraising

Subscribe to Updates

Get the latest business and startup news and updates directly to your inbox.

© 2025 Startup Dreamers. All Rights Reserved.
  • Privacy Policy
  • Terms of use
  • Press Release
  • Advertise
  • Contact

Type above and press Enter to search. Press Esc to cancel.

GET $5000 NO CREDIT