Bloomberg launches BloombergGPT

April 5, 2023

Bloomberg has created its own large-scale generative artificial intelligence (AI) model, BloombergGPT.

This model is a large language model (LLM) designed to “know” everything the entire company “knows.” BloombergGPT is specifically trained on a wide range of financial data, including the largest domain-specific dataset yet constructed, consisting of more than 363 billion tokens of Bloomberg’s financial data, news, filings, press releases, web-scraped financial documents, and social media.

The company has also trained BloombergGPT on another 345 billion tokens from general-purpose datasets obtained from elsewhere, including hundreds of English news sources, excluding those written by Bloomberg journalists, to maintain factuality and reduce bias. The Pile, which includes everything from YouTube captions to Project Gutenberg and a complete copy of Wikipedia, has also been included.

BloombergGPT will assist in improving existing financial NLP tasks, such as sentiment analysis, named entity recognition, news classification, and question answering. It is also able to translate natural language requests into the Bloomberg Query Language, a task which is tightly connected to Bloomberg’s needs. Additionally, the model can suggest Bloomberg-style headlines for news stories.

The new model is the first step in the development and application of this new technology for the financial industry, bringing the full potential of AI to the financial domain. This will unlock new opportunities for marshalling the vast quantities of data available on the Bloomberg Terminal to better help the firm’s customers.

BloombergGPT is trained on a corpus of more than 700 billion tokens, larger than OpenAI’s GPT-3 that was trained on about 500 billion tokens in 2020. The company’s research paper details the development of BloombergGPT and is written by Bloomberg’s Shijie Wu, Ozan İrsoy, Steven Lu, Vadim Dabravolski, Mark Dredze, Sebastian Gehrmann, Prabhanjan Kambadur, David Rosenberg, and Gideon Mann.

The sources for this piece include an article in NiemanLab.

Top Stories

Related Articles

December 23, 2025 Editor's Notes: This is the first of two articles reflecting on the year but Yogi Schulz. Schulz' more...

December 23, 2025 Google parent company Alphabet said Monday that it will acquire Intersect Power for $4.75 billion in cash more...

December 22, 2025 Artificial intelligence dominated global search behaviour in 2025, with Google’s own AI assistant, Gemini, emerging as the more...

December 22, 2025 OpenAI has hired the former head of Shopify’s core product organization to lead its next phase of more...

Jim Love

Jim is an author and podcast host with over 40 years in technology.

Share:
Facebook
Twitter
LinkedIn