This weekend, my attention was drawn to the lecture from Yann LeCun, a Professor at NYU and Chief AI Scientist at Meta, titled ‘Objective-Driven AI: Towards AI systems that can learn, remember, reason, plan, have common sense, yet are steerable and safe’ and four design patterns for agents from Andrew Ng.
The holiday period has started, and there will be no newsletter next week. An advance Eid Mubarak to whoever is celebrating.
The first slide of 97 Slidedeck starts with the title ‘Machine Learning sucks! (compared to humans and animals); I just want to leave every one of you to spare an hour and go through this complete slide deck of a master class on Artificial Intelligence.
Slide from Objective-Driven AI – Yann LeCun
Second good read on four design patterns introduced by Andrew Ng for AI Agent Workflows:
Reflection: The LLM examines its own work to come up with ways to improve it.
Tool use: The LLM is given tools such as web search, code execution, or any other function to help it gather information, take action, or process data.
Planning: The LLM comes up with, and executes, a multistep plan to achieve a goal (for example, writing an outline for an essay, then doing online research, then writing a draft, and so on).
Multi-agent collaboration: More than one AI agent work together, splitting up tasks and discussing and debating ideas, to come up with better solutions than a single agent would.
This week’s AI breakthroughs mark another leap forward in the tech revolution.
Jamba: AI21’s SSM-Transformer Model, enhances the Mamba Structured State Space model (SSM) technology with elements of the traditional Transformer architecture; Jamba compensates for the inherent limitations of a pure SSM model. Offering a 256K context window
Grok-1.5 is announced from X with improved reasoning capabilities and a context length of 128,000 tokens.
Third-party testing as a key ingredient of AI policy: In this article, Anthropic explores the concept of third-party testing, including its importance and the research that led to advocating for this policy stance. Additionally, they examine how testing intertwines with broader AI policy issues, like the availability of open models and concerns about regulatory capture.
Lumiere is a space-time diffusion research model from Google Research. Using fine-tuned text-to-image model weights, Lumiere can generate videos in the target style from a single reference image.
TensorRT-LLM running on NVIDIA H200 Tensor Core GPUs, the latest memory-enhanced Hopper GPUs, has shown the fastest performance running inference in MLPerf’s biggest test of generative AI. This benchmark was used on the giant version of Llama 2, packing 70 billion parameters. The model is more than 10 times larger than the GPT-J LLM first used in the earlier benchmarks.
Things to Know
Third-party testing as a key ingredient of AI policy: In this article, Anthropic explores the concept of third-party testing, including its importance and the research that led to advocating for this policy stance. Additionally, they examine how testing intertwines with broader AI policy issues, like the availability of open models and concerns about regulatory capture.
You Transformed the World,’ NVIDIA CEO Tells Researchers Behind Landmark AI Paper. Look at the people behind the research paper “Attention Is All You Need.”
The Opportunity…
Podcast:
This week’s Open Tech Talks episode 126 is “Meet AI Teacher: The Future of AI in Education Unveiled with Dr Pauldy Otermans and Dev Aditya”
GitHub Foundation Certification Series: An introductory session to explore the basics of GitHub as well as foundational concepts on Git with hands-on examples on how to use both
Introduction to Data-Centric AI: This class covers algorithms to find and fix common issues in ML data and to construct better datasets, concentrating on data used in supervised learning tasks like classification
Gradio: Build Machine Learning Web Apps in Python. It’s an open-source Python package that allows you to build a demo or web application for your machine-learning model
Fuel provides your machine learning models with the data they need to learn. Interfaces to common datasets such as MNIST, CIFAR-10 (image datasets), Google’s One Billion Words (text), and many more
Hit reply and let me know what you found most helpful this week – I’d love to hear from you!
Until next week,
Kashif Manzoor
The opinions expressed here are solely my conjecture based on experience, practice, and observation. They do not represent the thoughts, intentions, plans, or strategies of my current or previous employers or their clients/customers. The objective of this newsletter is to share and learn with the community.
We use cookies on our website to give you the most relevant experience by remembering your preferences and repeat visits. By clicking “Accept All”, you consent to the use of ALL the cookies. However, you may visit "Cookie Settings" to provide a controlled consent.
This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.
Necessary cookies are absolutely essential for the website to function properly. These cookies ensure basic functionalities and security features of the website, anonymously.
Cookie
Duration
Description
cookielawinfo-checkbox-analytics
11 months
This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional
11 months
The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary
11 months
This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others
11 months
This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance
11 months
This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy
11 months
The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
Functional cookies help to perform certain functionalities like sharing the content of the website on social media platforms, collect feedbacks, and other third-party features.
Performance cookies are used to understand and analyze the key performance indexes of the website which helps in delivering a better user experience for the visitors.
Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics the number of visitors, bounce rate, traffic source, etc.
Advertisement cookies are used to provide visitors with relevant ads and marketing campaigns. These cookies track visitors across websites and collect information to provide customized ads.