Microsoft unveils Phi-2, a small language model that packs power

Microsoft

If you consider language fashions in relation to generative artificial intelligence (AI), the primary time period that in all probability involves thoughts is giant language mannequin (LLM). These LLMs energy hottest chatbots, similar to ChatGPT, Bard, and Copilot. Nevertheless, Microsoft’s new language mannequin is right here to indicate that small language fashions (SLMs) have nice promise within the generative AI house, too.

On Wednesday, Microsoft released Phi-2, a small language mannequin able to common sense reasoning and language understanding, and it is now out there within the Azure AI Studio mannequin catalog.

Additionally: AI in 2023: A year of breakthroughs that left no human thing unchanged

Do not let the phrase small idiot you, although. Phi-2 packs 2.7 billion parameters in its mannequin, which is a giant bounce from Phi-1.5, which had 1.3 billion parameters.

Regardless of its compactness, Phi-2 showcased “state-of-the-art efficiency” amongst language fashions with lower than 13 billion parameters, and it even outperformed fashions as much as 25 instances bigger on advanced benchmarks, in keeping with Microsoft.

Additionally: Two breakthroughs made 2023 tech’s most innovative year in over a decade

Phi-2 outperformed fashions — together with Meta’s Llama-2, Mistral, and even Google’s Gemini Nano 2, which is the smallest model of Google’s most succesful LLM, Gemini — on a number of completely different benchmarks, as seen beneath.

Phi-2 performance on benchmarks — Microsoft

Phi-2’s efficiency outcomes are congruent with Microsoft’s aim with Phi of creating an SLM with emergent capabilities and efficiency akin to fashions on a a lot bigger scale.

Additionally: ChatGPT vs. Bing Chat vs. Google Bard: Which is the best AI chatbot?

“A query stays whether or not such emergent talents could be achieved at a smaller scale utilizing strategic decisions for coaching, e.g., knowledge choice,” stated Microsoft.

“Our line of labor with the Phi fashions goals to reply this query by coaching SLMs that obtain efficiency on par with fashions of a lot bigger scale (but nonetheless removed from the frontier fashions).”

When coaching Phi-2, Microsoft was very selective concerning the knowledge used. The corporate first used what it calls “text-book high quality” knowledge. Microsoft then augmented the language mannequin database by including rigorously chosen net knowledge, which was filtered on instructional worth and content material high quality.

So, why is Microsoft centered on SLMs?

Additionally: These 5 major tech advances of 2023 were the biggest game-changers

SLMs are a cheap various to LLMs. Smaller fashions are additionally helpful when they’re getting used for for a job that is not demanding sufficient to require the facility of an LLM.

Moreover, the computational energy required to run SLMs is way lower than LLMs. This decreased requirement means customers do not essentially must spend money on costly GPUs to energy their data-processing necessities.

Source link

Microsoft unveils Phi-2, a small language model that packs power

I switched to the iPhone 16 from an iPhone 15, and the upgrade was bigger than expected

Google Photos now has a subtle new but much needed feature

The best early October Prime Day 2024 deals to shop now

FTC report exposes massive data collection by social media brands – how to protect yourself

Learn a new language with 74% off a Babbel subscription

How to check in with friends or family from your Apple Watch

Leave A Reply Cancel Reply

Newsom Threatens Elon Musk with Legal Action For Sharing Kamala Harris AI Meme (VIDEO) | The Gateway Pundit

Israel says 100 Hezbollah rocket launchers hit in southern Lebanon

Here’s what to expect from Dale Earnhardt Jr.’s return to Bristol

LIKE A BOSS: Conservative Republican Rep. Harriet Hageman Expertly Trolls Code Pinko Protesters in DC (VIDEO) | The Gateway Pundit

Editors Picks

Newsom Threatens Elon Musk with Legal Action For Sharing Kamala Harris AI Meme (VIDEO) | The Gateway Pundit

Israel says 100 Hezbollah rocket launchers hit in southern Lebanon

Here’s what to expect from Dale Earnhardt Jr.’s return to Bristol

LIKE A BOSS: Conservative Republican Rep. Harriet Hageman Expertly Trolls Code Pinko Protesters in DC (VIDEO) | The Gateway Pundit

As Sri Lanka votes, a $2.9bn IMF loan looms large | Elections

Microsoft unveils Phi-2, a small language model that packs power

Keep Reading

Leave A Reply Cancel Reply