Research Note: IBM Granite 3.0 Models

IBM recently released Granite 3.0, its third generation of LLMs, designed to balance performance with safety, speed, and cost-efficiency for enterprise use. Its flagship model, Granite 3.0 8B Instruct, is a dense, instruction-tuned LLM optimized for enterprise tasks, trained on 12 trillion tokens across multiple languages and programming languages.
Let OpenAI GPT-4o Mini Introduce Itself

We usually write our own content, using LLMs to help refine our research, but in honor of OpenAI releasing its new GPT-4o Mini language model just a day after Meta released its Llama 3.1, and on the same day Mistral releases its new Mistral Large 2, we’ll let GPT 4o Mini tell you about itself.
Research Note: Mistral NeMo 12B Small Language Model

Mistral AI and NVIDIA launched Mistral NeMo 12B, a state-of-the-art language model for enterprise applications such as chatbots, multilingual tasks, coding, and summarization. The collaboration combines Mistral AI’s training data expertise with NVIDIA’s optimized hardware and software ecosystem, offering high performance across diverse applications.
Research Note: Databricks DBRX LLM

Databricks launched DBRX, a new open, general-purpose Large Language Model (LLM) that sets a new benchmark for performance and efficiency.
DBRX surpasses the capabilities of existing models like GPT-3.5 while also demonstrating competitive performance with closed models such as Gemini 1.0 Pro, making it a formidable player in general-purpose applications and specialized coding tasks.
HPE Brings AI-as-a-Service to the Enterprise with Its GreenLake for LLMs

Hewlett Packard Enterprise has always taken a decidedly infrastructure-centric approach to delivering flexible consumption-based compute, storage, and networking to enterprise IT. HPE provides that flexibility through its popular HPE GreenLake offerings. Last week at its annual HPE Discover Event in Las Vegas, HPE introduced the first of what the company says will be many domain-specific […]
MongoDB Embraces AI & Reduces Developer Friction With New Features

Servicing the quickly evolving needs of modern application development requires rapid innovation and fast product cycles. MongoDB demonstrated both last week at its MongoDB.local 2023 event in New York City, introducing a broad set of new features and services. The announcements cover a wide breadth of territory, with new capabilities to leverage the latest AI […]
Elastic’s Elasticsearch Relevance Engine Enables Generative AI Search
Background The challenge for an enterprise wanting to harness the power of large language models (LLMs) is that a language model is only as capable as the data it’s trained on and understands. This hampers the ability to leverage the technology to solve real-world business problems. LLMs become infinitely more powerful when deeply integrated with […]