2025 - Renise Black

2025-04-162025-04-22

Apprends Mardi 04-15

Je n’ai rien appris beaucoup de nouveaux mots aujourd’hui, mais j’ai appris quelque choses de grammaire. Je sais maintenant que l’ordre de plusieurs adjectifs avant le nom est très complexe. Il n’y pas des règles précises quand il y a plusieurs adjectifs. J’ai aussi appris comment utiliser un adverbe dans le passé composé.

French	English	Notes
Je <<vais bientôt terminer>> ce livre	I am going to finish this book soon.	Notice the adverb placement

Dernier Articles

2025-04-152025-05-03

Google Gen AI 5-Day Intensive: Day One (1/5)

Codelab 1/2 from day one. The code lab is here in Kaggle and you can download it to run locally on Github.

"""
Evaluation and Structured Output

Google Gen AI 5-Day Intensive Course
Host: Kaggle

Day: 1

Kaggle: https://www.kaggle.com/code/markishere/day-1-evaluation-and-structured-output
"""

import enum
import os

from google import genai
from google.api_core import retry
from google.genai import types
from IPython.display import Markdown, display

client = genai.Client(api_key=os.environ["GOOGLE_API_KEY"])

# Automated retry
is_retriable = lambda e: (
    isinstance(e, genai.errors.APIError) and e.code in {429, 503}
)

genai.models.Models.generate_content = retry.Retry(predicate=is_retriable)(
    genai.models.Models.generate_content
)

# if not hasattr(genai.models.Models.generate_content, '__wrapped__'):
#     genai.models.Models.generate_content = retry.Retry(
#         predicate=is_retriable)(genai.models.Models.generate_content)

# Evaluation
# Understand model performance
# Get the file locally first
# !wget -nv -O gemini.pdf https://storage.googleapis.com/cloud-samples-data/generative-ai/pdf/2403.05530.pdf
document_file = client.files.upload(file="/Users/renise/Documents/Python/gen_ai/day_one/gemini.pdf")
print("\n")
print(document_file)
print("\n")

print("\nSummarize a document\n")


# Summarize a document
def summarize_doc(request: str) -> str:
    """Execute the request on the uploaded document."""
    # Set the temperature low to stabilize the output.
    config = types.GenerateContentConfig(temperature=0.0)
    response = client.models.generate_content(
        model="gemini-2.0-flash",
        config=config,
        contents=[request, document_file],
    )
    return response.text


request = "Tell me about the training process used here."
summary = summarize_doc(request)
# display(Markdown(summary + "\n-----"))
print("\n\n")


# Define an evaluator
SUMMARY_PROMPT = """\
# Instruction
You are an expert evaluator. Your task is to evaluate the quality of the responses generated by AI models.
We will provide you with the user input and an AI-generated responses.
You should first read the user input carefully for analyzing the task, and then evaluate the quality of the responses based on the Criteria provided in the Evaluation section below.
You will assign the response a rating following the Rating Rubric and Evaluation Steps. Give step-by-step explanations for your rating, and only choose ratings from the Rating Rubric.

# Evaluation
## Metric Definition
You will be assessing summarization quality, which measures the overall ability to summarize text. Pay special attention to length constraints, such as in X words or in Y sentences. The instruction for performing a summarization task and the context to be summarized are provided in the user prompt. The response should be shorter than the text in the context. The response should not contain information that is not present in the context.

## Criteria
Instruction following: The response demonstrates a clear understanding of the summarization task instructions, satisfying all of the instruction's requirements.
Groundedness: The response contains information included only in the context. The response does not reference any outside information.
Conciseness: The response summarizes the relevant details in the original text without a significant loss in key information without being too verbose or terse.
Fluency: The response is well-organized and easy to read.

## Rating Rubric
5: (Very good). The summary follows instructions, is grounded, is concise, and fluent.
4: (Good). The summary follows instructions, is grounded, concise, and fluent.
3: (Ok). The summary mostly follows instructions, is grounded, but is not very concise and is not fluent.
2: (Bad). The summary is grounded, but does not follow the instructions.
1: (Very bad). The summary is not grounded.

## Evaluation Steps
STEP 1: Assess the response in aspects of instruction following, groundedness, conciseness, and verbosity according to the criteria.
STEP 2: Score based on the rubric.

# User Inputs and AI-generated Response
## User Inputs

### Prompt
{prompt}

## AI-generated Response
{response}
"""


# Define a structured enum class to capture the result.
class SummaryRating(enum.Enum):
    VERY_GOOD = 5
    GOOD = 4
    OK = 3
    BAD = 2
    VERY_BAD = 1


def eval_summary(prompt, ai_response):
    """Evaluate the generated summary against the prompt."""

    chat = client.chats.create(model="gemini-2.0-flash")

    # Generate the full text response
    response = chat.send_message(
        message=SUMMARY_PROMPT.format(prompt=prompt, response=ai_response)
    )
    verbose_eval = response.text

    # Coerce into desired structure
    structured_output_config = types.GenerateContentConfig(
        response_mime_type="text/x.enum", 
        response_schema=SummaryRating
    )
    response = chat.send_message(
        message="Convert the final score.", 
        config=structured_output_config
    )
    structured_eval = response.parsed

    return verbose_eval, structured_eval


text_eval, struct_eval = eval_summary(
    prompt=[request, document_file], 
    ai_response=summary
)
Markdown(text_eval)

# Play with the summary prompt
new_prompt = "Explain like I'm 5 the training process"

# Try:
#  ELI5 the training process
#  Summarise the needle/haystack evaluation technique in 1 line
#  Describe the model architecture to someone with a civil engineering degree
#  What is the best LLM?
if not new_prompt:
    raise ValueError("Try setting a new summarization prompt.")


def run_and_eval_summary(prompt):
    """Generate and evaluate the summary using the new prompt."""
    summary = summarize_doc(new_prompt)
    display(Markdown(summary + "\n-----"))

    text, struct = eval_summary([new_prompt, document_file], summary)
    display(Markdown(text + "\n-----"))
    print(struct)


run_and_eval_summary(new_prompt)

Apprends Jeudi 04.10

CEFR: 36

Aujourd’hui, j’ai appris beaucoup de choses. J’ai appris la ville de Paris. J’ai appris aussi <<voudrions>>, <<le plus proche>>, <<faire pour aller là-bas>>, <<rompre>>, <<déménager>>, <<vendre de commencer>>, <<vous vous retournez>>, <<grandir>>, <<la bonne taille>>, et autres en bas.

French	English	Notes
Où est le café <<le plus proche?>>	Where is <<the closest>> cafe
Excusez-moi, je suis <<perdue>>	Excuse me, I am <<lost>>
Nous <<voudrions>> aller à la gare	We <<would like>> to go to the train station
Comment est-ce que je <<fais pour aller là-bas>>?	How do I get over there?	This is a common saying I should know
Il y a <<tellement de>> voitures	There are <<so many>> cars	The use of “de” for singular and plural
Qui a <<le plan>>?	Who has <<the map>>
Berlin est <<une capitale>>	Berlin is a <<capital city>>	Notice the gender
J’habite dans <<le centre-ville>>	I live in <<the city center>>
Si <<tu te retournes>> je vais prendre une photo>>	If <<you turn (yourself) around>> I am going to take a photo
Il ne faut pas traverser ici	One must not cross here	Something important to understand
<<Tu romps>> avec moi?	Are you breaking up with me	rompre (v): to break up
Ca <<ne vas pas être>> long	It won’t be long
Mais, <<nous devons rompre>>	But, we have to break up
Je dois déménager pour le travail	I have to move (out) for work
Et, je n’ai <<que quelques jours de>> vacances per an.	And, I don’t have <<many>> vacations each year
Alors on ne rompt pas?	Then we don’t break up
discute	discuss	discuter (v)
Si <<vous vous retournez>>, je vais prendre une photo	See above
La réunion <<vient de commencer>>	The meeting <<is about to start>>
J’<<ai raison>>, et tu <<as tort>>	I <<am right>>, and tu <<are wrong>>
Elle va grandir>>	She is going <<to grow up>>	grandir: grow, grow up
Je vais le remplir avec de l’eau	I am going <<to fill it>> (the glass) with water
Il fait choisir <<la bonne taille>>	One must chose <<the right size>>
Il est <<neuf>>	It is <<new>>	neuf (adj): nine, new, fresh, brand new, mint condition
Je voudrais des <<vêtements neufs>>	I would like new clothes
Ce n’est pas <<la bonne taille>>	That’s not <<the right size>>
J’ai choisi ces chaussures parce qu’elles sont <<bon marché>>	I chose these shoes because they are <<inexpensive>>	Compared to saying “pas cher” or “not expensive”
J’<<ai grandi>> à Californie	I <<grew up>> in California

Tag: 2025

Apprends Mardi 04-15

Dernier Articles

Google Gen AI 5-Day Intensive: Day One (1/5)

Recent Posts

Apprends Jeudi 04.10