April 29, 2024
Why in News? Microsoft claims that its latest small language models have outperformed several AI models of its size, as well as bigger ones. It said that India’s ITC also leveraged the new Phi-3-mini.
A few days after Meta unveiled its Llama 3 Large Language Model (LLM), Microsoft on Tuesday (April 23) unveiled the latest version of its ‘lightweight’ AI model – the Phi-3-Mini. Microsoft has described the Phi-3 as a family of open AI models that are the most capable and cost-effective small language models (SLMs) available.
What is Phi-3-mini?
Phi-3-Mini is believed to be first among the three small models that Microsoft is planning to release. It has reportedly outperformed models of the same size and the next size up across a variety of benchmarks, in areas like language, reasoning, coding, and maths.
What are LLM & SLM?
The ‘Large’ in LLMs has two meanings — the enormous size of training data; and the parameter count. In the field of Machine Learning, where machines are equipped to learn things themselves without being instructed, parameters are the memories and knowledge that a machine has learned during its model training. They define the skill of the model in solving a specific problem.
A large language model (LLM) is a type of artificial intelligence (AI) program that can recognize and generate text, among other tasks. LLMs are trained on huge sets of data — hence the name “large.” LLMs are built on machine learning: specifically, a type of neural network called a transformer model.
November 5, 2024
November 5, 2024
November 5, 2024
B-36, Sector-C, Aliganj – Near Aliganj, Post Office Lucknow – 226024 (U.P.) India
vaidsicslucknow1@gmail.com
+91 8858209990, +91 9415011892
© www.vaidicslucknow.com. All Rights Reserved.