Welcome to your weekly AI Newsletter from AITechCircle!
I’m building and implementing AI solutions and sharing everything I learn along the way…
Check out the updates from this week! Please take a moment to share them with a friend or colleague who might benefit from these valuable insights!
Today at a Glance:
Getting started with the OCI Gen AI RAG Agent
Generative AI Use cases repository
AI Weekly news and updates covering newly released LLMs
Courses and events to attend
Creating your first Generative AI RAG Agent
As Agentic AI is the mainstream focus for 2025, if you are new to this topic, you can read an earlier post,’From Perception AI to Generative to Agentic to Physical AI, ‘ which covers the fundamentals and the different stages of AI. This will enable you to understand the required concepts. Today, I will cover the overview of OCI Generative AI Agent Service, a managed service.
It combines LLMs with an intelligent retrieval system to create a contextually aware knowledge base for users’ queries and answers.
Three Types of Knowledge bases supported:
OCI Object storage files of pdf and text types as part of the managed service
Oracle Database 23ai vector search – bring your own option
OCI Search with OpenSearch ingested and indexed data – bring your own option
A retrieval-augmented generation (RAG) agent integrates retrieval capabilities with language generation to answer queries effectively. This AI combines factual precision from specific data sources with language models’ adaptability to craft contextually accurate and detailed responses. RAG agents are beneficial across various fields:
Customer Support: Speeding up responses by pulling data from company databases.
Legal Research: Assisting legal professionals by sourcing relevant case laws and precedents.
Financial Analysis: Analyze extensive financial data and market reports.
Educational Tutoring: Acting as tutors by providing educational content and resources.
Technical Support: Offering guided troubleshooting assistance from technical documents.
Supply Chain Management: Enhancing operational decisions by analyzing logistics data.
Content Creation: Supporting content creators with data and draft suggestions.
Content Moderation:
Content Moderation in OCI Generative AI Agents is a feature that helps identify and block inappropriate content like toxicity, violence, and abuse in interactions with large language models. This tool classifies harmful content into four categories:
Hate and Harassment: Includes identity attacks, insults, threats of violence, and sexual aggression.
Self-Inflicted Harm: Covers self-harm and the promotion of eating disorders.
Ideological Harm: Relates to extremism, terrorism, and organized crime.
Exploitation: Encompasses scams and sexual abuse.
Let’s start exploring:
Open the navigation menu after logging into Oracle Cloud and select Analytics & AI. Under AI Services, select Generative AI Agents.
You will see the screen below with the three steps mentioned:
As a first step, you need to create the knowledge base, and for that, you need to have a bucked on the storage:
You should have a knowledge repository or documents in PDF or text format, which you will upload to this bucket. For example, I have created the ‘Organization HR booklet’ and ‘HR Leave policy.’
Now go back to Step 1 of creating the Knowledge Base and create it as per the screenshot below.
After setting the ‘Knowledge Base, ‘ go to Step 2:Create Agent.
Now, let’s interact with the Agent to test it and open the Chat from the left side of the menu.
You can see the results with the proper citation from where the agent is getting. Now, you can expose this agent via endpoint into your Digital Assistant/Chatbot or any GUI you already have.
Source: Blog: GenAI RAG using OCI Digital Assistant and OCI Generative AI Agents
Data Handling in Generative AI Agent
In the OCI Generative AI Agents service, customer queries and related data are only used during a session and not stored afterward. This ensures that personal chat or knowledge base data is not used to train the model or enhance the service. The data from each session is handled privately without being retained or used for any purposes beyond immediate interaction.
Cost Estimates of Generative AI Agent:
Now, the key question is how to calculate the cost of this agent if you want to use it in your AI ecosystem. To answer this, head over to the Oracle Cost Estimator.
In the above example, I have chosen the following configurations:
Request per month: 1000
Average Number of total characters processed per request: 1,500
Storage Capacity: 5GB
Data Ingestion Jobs per month: 10
Average Number of characters processed per data ingestion job: 50,000
The exported cost estimates are in Excel format below; you can download them to examine them.
Try it out yourself, log in to your Oracle Tenancy, and have fun learning how easy it is to get ready with your first Generative AI Agent.
Weekly News & Updates…
Last week’s AI breakthroughs marked another leap forward in the tech revolution.
The Anthropic Economic Index has been released to understand AI’s effects on labor markets and the economy over time. link
Open AI roadmap for ChatGPT 4.5 and 5, few insights from Sam Altman
The Cloud: the backbone of the AI revolution
Agentic AI: A Game Changer for the Construction Industry, link
How Scaling Laws Drive Smarter, More Powerful AI link
Generative AI Use Case of the Week:
Several Generative AI use cases are documented, and you can access the library of Gen AI Use cases. Link
Favorite Tip Of The Week:
Here’s my favorite resource for the week.
OpenAI has released the best practices on using o-series models, primarily focusing on the three areas below:
Differences between reasoning and non-reasoning models
When to use reasoning models
How to approach prompting reasoning models effectively
Things to Know…
AI Action Summit was held in France to ensure the development of trusted, safe, and secure AI. The statement was signed by several countries, primarily focusing on the following priorities:
Promoting AI accessibility to reduce digital divides;
Ensuring AI is open, inclusive, transparent, ethical, safe, secure and trustworthy, taking into account international frameworks for all
Making innovation in AI thrive by enabling conditions for its development and avoiding market concentration driving industrial recovery and development
Encouraging AI deployment that positively shapes the future of work and labour markets and delivers opportunity for sustainable growth
Making AI sustainable for people and the planet
Reinforcing international cooperation to promote coordination in international governance
The Opportunity…
Podcast:
This week’s Open Tech Talks episode 154 is “Generative AI Risks and Governance: What Business Leaders Need to Know with Terry Ziemniak.” he is a Fractional CISO and Partner at TechCXO
Dify is an open-source LLM app development platform
And that’s a wrap!
Thank you, as always, for taking the time to read.
I would love to hear your thoughts. Please reply and share what you found most valuable this week. Your feedback means a lot to me.
Until next week,
Kashif Manzoor
The opinions expressed here are solely my conjecture based on experience, practice, and observation. They do not represent the thoughts, intentions, plans, or strategies of my current or previous employers or their clients/customers. The objective of this newsletter is to share and learn with the community.
We use cookies on our website to give you the most relevant experience by remembering your preferences and repeat visits. By clicking “Accept All”, you consent to the use of ALL the cookies. However, you may visit "Cookie Settings" to provide a controlled consent.
This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.
Necessary cookies are absolutely essential for the website to function properly. These cookies ensure basic functionalities and security features of the website, anonymously.
Cookie
Duration
Description
cookielawinfo-checkbox-analytics
11 months
This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional
11 months
The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary
11 months
This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others
11 months
This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance
11 months
This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy
11 months
The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
Functional cookies help to perform certain functionalities like sharing the content of the website on social media platforms, collect feedbacks, and other third-party features.
Performance cookies are used to understand and analyze the key performance indexes of the website which helps in delivering a better user experience for the visitors.
Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics the number of visitors, bounce rate, traffic source, etc.
Advertisement cookies are used to provide visitors with relevant ads and marketing campaigns. These cookies track visitors across websites and collect information to provide customized ads.