For the last six years (starting in 2018), AI investors Nathan Benaich and Air Street Capital have consistently published the State of AI Report. This article summarizes key takeaways from the 2024 report published here, along with the author’s interpretation of model cards, industry reports, research reports, and more available publicly. #1 – Frontier lab performance converges, but OpenAI maintains its edge following the launch of o1 (aka Strawberry) When…
Read MoreBuilding Gen AI Apps with Snowflake Cortex as a Foundational AI Platform
An illustrative blog with a reference application applying Snowflake Cortex With Generative AI in mainstream adoption, Snowflake has shared its Generative AI Vision in the past to bring Gen AI and LLMs closer to the customers’ data. Snowflake’s Cortex AI is their fully managed service (GA on May 24) to manage LLMs and the entire lifecycle for diverse business and technical users: This article shares the key architecture and design approaches and related components…
Read MoreTop Ten Technology Trends for 2024
Observing technology trends by analysts, research companies, and thought leaders provides a broader perspective. It not only helps software architects to understand the impact of technologies being adopted but also helps them build the right skill set for themselves or their team. This article synthesizes the top ten technology trends for 2024 and beyond based on my broader research, technology insights, observations, and industry experience. #1 — Generative AI and AI Platforms Generative AI appears as…
Read MoreTop 5 Open Source LLMs for Building Gen AI Enterprise Applications
Introduction Large Language Models (LLMs) do not need any introduction with the rise of its massive adoption in the industry. Most enterprises have either adopted or planning to adopt an LLM to build Generative AI-based enterprise applications supporting a variety of business use cases. While there are many closed-source options available such as – OpenAI’s GPT-3.5 or GPT-4, Google’s Gemini, etc., Open source LLMs have started getting traction because of…
Read MoreKey Takeaways from AWS re:Invent 2023 for Software Architects
AWS re:Invent 2023 has continued the tradition of being the most happening cloud computing technology event of the year. While there were many sessions covering leadership, partnership, technology updates, case studies, and much more — this article focused on sharing key takeaways from the event for software architects. Read about 2022 re:Invent takeaways by clicking here. #1 — The frugal architect with cost awareness and sustainability mindset Werner’s keynote has always…
Read MoreElevate Your AI Journey with Amazon Bedrock: Unraveling The Key Features
Amazon Bedrock, which was announced in April 2023, has drawn a lot of attention from businesses looking to leverage their existing AWS architecture for building Generative AI applications. Amazon announced the General Availability (GA) of Bedrock on September 28 — a service that offers a choice of Generative AI models from Amazon and third-party providers through an API-based interface. With the current market landscape (high demand for leveraging GAI for building innovative business…
Read MoreMeta launches Code Llama expanding Llama (AI Model) Capabilities
Meta released Code Llama, a large language model (LLM) that can use text prompts to generate and discuss code, on August 24, 2023. It has been built on Llama 2 as a foundational model and is free for research and commercial use. Click here to read the news annoucment published by Meta. The below visualization depicts the foundational building block of Llama 2, and an approach to build your own…
Read MoreMicrosoft’s Partner Ecosystem: A Promising Outlook and Strong Belief with Generative AI and Cloud
Microsoft Ignite 2023, a conference aimed at expanding the partner ecosystem, just came to a close. Key announcements have been summarized below: Microsoft is expanding its partnership with Meta by launching Llama 2 on Azure and Windows.Launched Bing Chat Enterprise and Microsoft 365 Copilot for Enterprise Customers — the concept of Copilot is not limited to GitHub, the foundational building block has been extended to other products. Bing Chat Enterprise enables organizations…
Read MoreStack Overflow announced Generative AI Capability to Boost Developer’s Productivity
Stack Overflow, which is the largest question-and-answer website for developers and technologists serving the community for 10+ years. The below statistics indicate the popularity of the platform in the developer community: Their CEO, Prashanth Chandrasekar, has recently shared the roadmap for the integration of Generative AI into the public platform (Stack Overflow for Teams). Launched as Overflow AI, the suite of capabilities will provide the following key benefits to the…
Read MoreTop Ten Technology Trends for 2023
Observing technology trends by analysts, research companies, and thought leaders provides a broader perspective. It not only helps software architects to understand the impact of technologies being adopted but also helps them build the right skill set for themselves or their team. This article synthesizes the top ten technology trends for 2023 and beyond based on broader research. This article is in continuation to our previous article for 2022 — click here…
Read More