Enhanced Inference

Last updated: February 8, 2026Reviewed date: February 8, 2026Reading time: 13 min read

Patient Tools

Read, save, and share this guide

Use these quick tools to make this medical article easier to read, print, save, or share with a family member.

On this page16 sections

Article Summary

autogen.OpenAIWrapper provides enhanced LLM inference for openai>=1. autogen.Completion is a drop-in replacement of openai.Completion and openai.ChatCompletion for enhanced LLM inference using openai<1. There are a number of benefits of using autogen to perform inference: performance tuning, API unification, caching, error handling, multi-config inference, result filtering, templating and so on. Tune Inference Parameters (for openai<1) Find a list of examples in this page: Tune Inference Parameters Examples Choices to optimize The cost of using foundation models for text...

Key Takeaways

This article explains Tune Inference Parameters (for openai<1) in simple medical language.
This article explains API unification in simple medical language.
This article explains Usage Summary in simple medical language.
This article explains Caching in simple medical language.

Before reading

RX Patient Tools

Use these quick guides before reading the article, or return to them when you need help preparing questions for a doctor.

Start here Choose the right pathway for symptoms, reports, medicines, or urgent warning signs. Disease article roadmap Read this topic step by step: meaning, symptoms, warning signs, diagnosis, treatment, prevention, and follow-up. Treatment planner Prepare questions about treatment choices, benefits, risks, side effects, and follow-up. Family & caregiver guide Organize symptoms, reports, medicines, questions, and follow-up safely. Nutrition & diet guide Prepare food, hydration, supplement, and medicine-timing questions safely. Prevention guide Organize risk factors, protective habits, screening, and warning signs. Recovery guide Prepare a safe plan for activity, rehabilitation, warning signs, and follow-up.

Educational health guideWritten for patient understanding and clinical awareness.

Reviewed content workflowUse writer and reviewer profiles for stronger trust.

Emergency safety firstUrgent warning signs are highlighted below.

Definition

autogen.OpenAIWrapper provides enhanced LLM inference for openai>=1. autogen.Completion is a drop-in replacement of openai.Completion and openai.ChatCompletion for enhanced LLM inference using openai<1. There are a number of benefits of using autogen to perform inference: performance tuning, API unification, caching, error handling, multi-config inference, result filtering, templating and so on.

Tune Inference Parameters (for openai<1)

Find a list of examples in this page: Tune Inference Parameters Examples

Choices to optimize

The cost of using foundation models for text generation is typically measured in terms of the number of tokens in the input and output combined. From the perspective of an application builder using foundation models, the use case is to maximize the utility of the generated text under an inference budget constraint (e.g., measured by the average dollar cost needed to solve a coding problem). This can be achieved by optimizing the hyperparameters of the inference, which can significantly affect both the utility and the cost of the generated text.

The tunable hyperparameters include:

model – this is a required input, specifying the model ID to use.
prompt/messages – the input prompt/messages to the model, which provides the context for the text generation task.
max_tokens – the maximum number of tokens (words or word pieces) to generate in the output.
temperature – a value between 0 and 1 that controls the randomness of the generated text. A higher temperature will result in more random and diverse text, while a lower temperature will result in more predictable text.
top_p – a value between 0 and 1 that controls the sampling probability mass for each token generation. A lower top_p value will make it more likely to generate text based on the most likely tokens, while a higher value will allow the model to explore a wider range of possible tokens.
n – the number of responses to generate for a given prompt. Generating multiple responses can provide more diverse and potentially more useful output, but it also increases the cost of the request.
stop – a list of strings that, when encountered in the generated text, will cause the generation to stop. This can be used to control the length or the validity of the output.
presence_penalty, frequency_penalty – values that control the relative importance of the presence and frequency of certain words or phrases in the generated text.
best_of – the number of responses to generate server-side when selecting the “best” (the one with the highest log probability per token) response for a given prompt.

The cost and utility of text generation are intertwined with the joint effect of these hyperparameters. There are also complex interactions among subsets of the hyperparameters. For example, the temperature and top_p are not recommended to be altered from their default values together because they both control the randomness of the generated text, and changing both at the same time can result in conflicting effects; n and best_of are rarely tuned together because if the application can process multiple outputs, filtering on the server side causes unnecessary information loss; both n and max_tokens will affect the total number of tokens generated, which in turn will affect the cost of the request. These interactions and trade-offs make it difficult to manually determine the optimal hyperparameter settings for a given text generation task.

Do the choices matter? Check this blogpost to find example tuning results about gpt-3.5-turbo and gpt-4.

With AutoGen, the tuning can be performed with the following information:

Validation data.
Evaluation function.
Metric to optimize.
Search space.
Budgets: inference and optimization respectively.

Validation data

Collect a diverse set of instances. They can be stored in an iterable of dicts. For example, each instance dict can contain “problem” as a key and the description str of a math problem as the value; and “solution” as a key and the solution str as the value.

Evaluation function

The evaluation function should take a list of responses, and other keyword arguments corresponding to the keys in each validation data instance as input, and output a dict of metrics. For example,

def eval_math_responses(responses: List[str], solution: str, **args) -> Dict:
    # select a response from the list of responses
    answer = voted_answer(responses)
    # check whether the answer is correct
    return {"success": is_equivalent(answer, solution)}

autogen.code_utils and autogen.math_utils offer some example evaluation functions for code generation and math problem solving.

Metric to optimize

The metric to optimize is usually an aggregated metric over all the tuning data instances. For example, users can specify “success” as the metric and “max” as the optimization mode. By default, the aggregation function is taking the average. Users can provide a customized aggregation function if needed.

Search space

Users can specify the (optional) search range for each hyperparameter.

model. Either a constant str, or multiple choices specified by flaml.tune.choice.
prompt/messages. Prompt is either a str or a list of strs, of the prompt templates. messages is a list of dicts or a list of lists, of the message templates. Each prompt/message template will be formatted with each data instance. For example, the prompt template can be: “{problem} Solve the problem carefully. Simplify your answer as much as possible. Put the final answer in \boxed{{}}.” And {problem} will be replaced by the “problem” field of each data instance.
max_tokens, n, best_of. They can be constants, or specified by flaml.tune.randint, flaml.tune.qrandint, flaml.tune.lograndint or flaml.qlograndint. By default, max_tokens is searched in [50, 1000); n is searched in [1, 100); and best_of is fixed to 1.
stop. It can be a str or a list of strs, or a list of lists of strs or None. Default is None.
temperature or top_p. One of them can be specified as a constant or by flaml.tune.uniform or flaml.tune.loguniform etc. Please don’t provide both. By default, each configuration will choose either a temperature or a top_p in [0, 1] uniformly.
presence_penalty, frequency_penalty. They can be constants or specified by flaml.tune.uniform etc. Not tuned by default.

Budgets

One can specify an inference budget and an optimization budget. The inference budget refers to the average inference cost per data instance. The optimization budget refers to the total budget allowed in the tuning process. Both are measured by dollars and follow the price per 1000 tokens.

Perform tuning

Now, you can use autogen.Completion.tune for tuning. For example,

import autogen

config, analysis = autogen.Completion.tune(
    data=tune_data,
    metric="success",
    mode="max",
    eval_func=eval_func,
    inference_budget=0.05,
    optimization_budget=3,
    num_samples=-1,
)

num_samples is the number of configurations to sample. -1 means unlimited (until optimization budget is exhausted). The returned config contains the optimized configuration and analysis contains an ExperimentAnalysis object for all the tried configurations and results.

The tuned config can be used to perform inference.

API unification

autogen.OpenAIWrapper.create() can be used to create completions for both chat and non-chat models, and both OpenAI API and Azure OpenAI API.

from autogen import OpenAIWrapper
# OpenAI endpoint
client = OpenAIWrapper()
# ChatCompletion
response = client.create(messages=[{"role": "user", "content": "2+2="}], model="gpt-3.5-turbo")
# extract the response text
print(client.extract_text_or_completion_object(response))
# get cost of this completion
print(response.cost)
# Azure OpenAI endpoint
client = OpenAIWrapper(api_key=..., base_url=..., api_version=..., api_type="azure")
# Completion
response = client.create(prompt="2+2=", model="gpt-3.5-turbo-instruct")
# extract the response text
print(client.extract_text_or_completion_object(response))

For local LLMs, one can spin up an endpoint using a package like FastChat, and then use the same API to send a request. See here for examples on how to make inference with local LLMs.

Usage Summary

The OpenAIWrapper from autogen tracks token counts and costs of your API calls. Use the create() method to initiate requests and print_usage_summary() to retrieve a detailed usage report, including total cost and token usage for both cached and actual requests.

mode=["actual", "total"] (default): print usage summary for all completions and non-caching completions.
mode='actual': only print non-cached usage.
mode='total': only print all usage (including cache).

Reset your session’s usage data with clear_usage_summary() when needed. View Notebook

Example usage:

from autogen import OpenAIWrapper

client = OpenAIWrapper()
client.create(messages=[{"role": "user", "content": "Python learning tips."}], model="gpt-3.5-turbo")
client.print_usage_summary()  # Display usage
client.clear_usage_summary()  # Reset usage data

Sample output:

Usage summary excluding cached usage:
Total cost: 0.00015
* Model 'gpt-3.5-turbo': cost: 0.00015, prompt_tokens: 25, completion_tokens: 58, total_tokens: 83

Usage summary including cached usage:
Total cost: 0.00027
* Model 'gpt-3.5-turbo': cost: 0.00027, prompt_tokens: 50, completion_tokens: 100, total_tokens: 150

Caching

API call results are cached locally and reused when the same request is issued. This is useful when repeating or continuing experiments for reproducibility and cost saving. It still allows controlled randomness by setting the “cache_seed” specified in OpenAIWrapper.create() or the constructor of OpenAIWrapper.

client = OpenAIWrapper(cache_seed=...)
client.create(...)

client = OpenAIWrapper()
client.create(cache_seed=..., ...)

Caching is enabled by default with cache_seed 41. To disable it please set cache_seed to None.

NOTE. openai v1.1 introduces a new param seed. The difference between autogen’s cache_seed and openai’s seed is that:

autogen uses local disk cache to guarantee the exactly same output is produced for the same input and when cache is hit, no openai api call will be made.
openai’s seed is a best-effort deterministic sampling with no guarantee of determinism. When using openai’s seed with cache_seed set to None, even for the same input, an openai api call will be made and there is no guarantee for getting exactly the same output.

Error handling

Runtime error

One can pass a list of configurations of different models/endpoints to mitigate the rate limits and other runtime error. For example,

client = OpenAIWrapper(
    config_list=[
        {
            "model": "gpt-4",
            "api_key": os.environ.get("AZURE_OPENAI_API_KEY"),
            "api_type": "azure",
            "base_url": os.environ.get("AZURE_OPENAI_API_BASE"),
            "api_version": "2023-08-01-preview",
        },
        {
            "model": "gpt-3.5-turbo",
            "api_key": os.environ.get("OPENAI_API_KEY"),
            "base_url": "https://api.openai.com/v1",
        },
        {
            "model": "llama2-chat-7B",
            "base_url": "http://127.0.0.1:8080",
        }
    ],
)

client.create() will try querying Azure OpenAI gpt-4, OpenAI gpt-3.5-turbo, and a locally hosted llama2-chat-7B one by one, until a valid result is returned. This can speed up the development process where the rate limit is a bottleneck. An error will be raised if the last choice fails. So make sure the last choice in the list has the best availability.

For convenience, we provide a number of utility functions to load config lists.

get_config_list: Generates configurations for API calls, primarily from provided API keys.
config_list_openai_aoai: Constructs a list of configurations using both Azure OpenAI and OpenAI endpoints, sourcing API keys from environment variables or local files.
config_list_from_json: Loads configurations from a JSON structure, either from an environment variable or a local JSON file, with the flexibility of filtering configurations based on given criteria.
config_list_from_models: Creates configurations based on a provided list of models, useful when targeting specific models without manually specifying each configuration.
config_list_from_dotenv: Constructs a configuration list from a .env file, offering a consolidated way to manage multiple API configurations and keys from a single file.

We suggest that you take a look at this notebook for full code examples of the different methods to configure your model endpoints.

Logic error

Another type of error is that the returned response does not satisfy a requirement. For example, if the response is required to be a valid json string, one would like to filter the responses that are not. This can be achieved by providing a list of configurations and a filter function. For example,

def valid_json_filter(response, **_):
    for text in OpenAIWrapper.extract_text_or_completion_object(response):
        try:
            json.loads(text)
            return True
        except ValueError:
            pass
    return False

client = OpenAIWrapper(
    config_list=[{"model": "text-ada-001"}, {"model": "gpt-3.5-turbo-instruct"}, {"model": "text-davinci-003"}],
)
response = client.create(
    prompt="How to construct a json request to Bing API to search for 'latest AI news'? Return the JSON request.",
    filter_func=valid_json_filter,
)

The example above will try to use text-ada-001, gpt-3.5-turbo-instruct, and text-davinci-003 iteratively, until a valid json string is returned or the last config is used. One can also repeat the same model in the list for multiple times (with different seeds) to try one model multiple times for increasing the robustness of the final response.

Advanced use case: Check this blogpost to find how to improve GPT-4’s coding performance from 68% to 90% while reducing the inference cost.

Templating

If the provided prompt or message is a template, it will be automatically materialized with a given context. For example,

response = client.create(
    context={"problem": "How many positive integers, not exceeding 100, are multiples of 2 or 3 but not 4?"},
    prompt="{problem} Solve the problem carefully.",
    allow_format_str_template=True,
    **config
)

A template is either a format str, like the example above, or a function which produces a str from several input fields, like the example below.

def content(turn, context):
    return "\n".join(
        [
            context[f"user_message_{turn}"],
            context[f"external_info_{turn}"]
        ]
    )

messages = [
    {
        "role": "system",
        "content": "You are a teaching assistant of math.",
    },
    {
        "role": "user",
        "content": partial(content, turn=0),
    },
]
context = {
    "user_message_0": "Could you explain the solution to Problem 1?",
    "external_info_0": "Problem 1: ...",
}

response = client.create(context=context, messages=messages, **config)
messages.append(
    {
        "role": "assistant",
        "content": client.extract_text(response)[0]
    }
)
messages.append(
    {
        "role": "user",
        "content": partial(content, turn=1),
    },
)
context.append(
    {
        "user_message_1": "Why can't we apply Theorem 1 to Equation (2)?",
        "external_info_1": "Theorem 1: ...",
    }
)
response = client.create(context=context, messages=messages, **config)

Logging (for openai<1)

When debugging or diagnosing an LLM-based system, it is often convenient to log the API calls and analyze them. autogen.Completion and autogen.ChatCompletion offer an easy way to collect the API call histories. For example, to log the chat histories, simply run:

autogen.ChatCompletion.start_logging()

The API calls made after this will be automatically logged. They can be retrieved at any time by:

autogen.ChatCompletion.logged_history

There is a function that can be used to print usage summary (total cost, and token count usage from each model):

autogen.ChatCompletion.print_usage_summary()

To stop logging, use

autogen.ChatCompletion.stop_logging()

If one would like to append the history to an existing dict, pass the dict like:

autogen.ChatCompletion.start_logging(history_dict=existing_history_dict)

By default, the counter of API calls will be reset at start_logging(). If no reset is desired, set reset_counter=False.

There are two types of logging formats: compact logging and individual API call logging. The default format is compact. Set compact=False in start_logging() to switch.

Example of a history dict with compact logging.

{
    """
    [
        {
            'role': 'system',
            'content': system_message,
        },
        {
            'role': 'user',
            'content': user_message_1,
        },
        {
            'role': 'assistant',
            'content': assistant_message_1,
        },
        {
            'role': 'user',
            'content': user_message_2,
        },
        {
            'role': 'assistant',
            'content': assistant_message_2,
        },
    ]""": {
        "created_at": [0, 1],
        "cost": [0.1, 0.2],
    }
}

dict with individual API call logging. language-python"> python"> language-python codeBlock_rtdJ thin-scrollbar" tabindex="0">

{ 0: { "request": { "messages": [ { "role": "system", "content": system_message, }, { "role": "user", "content": user_message_1, } ], ... # other parameters in the request }, "response": { "choices": [ "messages": { "role": "assistant", "content": assistant_message_1, }, ], ... # other fields in the response } }, 1: { "request": { "messages": [ { "role": "system", "content": system_message, }, { "role": "user", "content": user_message_1, }, { "role": "assistant", "content": assistant_message_1, }, { "role": "user", "content": user_message_2, }, ], ... # other parameters in the request }, "response": { "choices": [ "messages": { "role": "assistant", "content": assistant_message_2, }, ], ... # other fields in the response } }, class="token punctuation">} clean-btn" type="button" aria-label="Copy code to clipboard">Copy

 for usage summary language-undefined codeBlock_rtdJ thin-scrollbar" tabindex="0">Total cost: <cost> class="token plain">Token count summary for model <model>: prompt_tokens: <count 1>, completion_tokens: <count 2>, total_tokens: <count 3> clean-btn" type="button" aria-label="Copy code to clipboard">Copy

 the individual API call history contains redundant information of the conversation. For a long conversation the degree of redundancy is high. The compact history is more efficient and the individual API call history contains more details.
Born to break the complexity.



	        
            
                Related Articles
                More from this topic
            
            
                                    
                        
                                                            RX
                                                    
                        
                            Transient Blockage of the Internal Iliac Artery
                            DefinitionThe internal iliac artery? is a crucial blood vessel in the pelvis?, responsible for supplying blood to various organs and tissues in the lower abdomen? and...
                            
                                January 25, 2024
                                                                    •
                                    Rx iT World Hacking Tutorial
                                                            
                        
                    
                                    
                        
                                                            RX
                                                    
                        
                            GraphQL
                            DefinitionGraphQL is an open source query language originally developed by Facebook that can be used to build APIs as an alternative to REST and SOAP. It has...
                            
                                January 25, 2024
                                                                    •
                                    Rx iT World Hacking Tutorial
                                                            
                        
                    
                                    
                        
                                                            RX
                                                    
                        
                            Forgot Password Service
                            DefinitionIn order to implement a proper user management system, systems integrate a Forgot Password service that allows the user to request a password reset. Even though this functionality...
                            
                                January 25, 2024
                                                                    •
                                    Rx iT World Hacking Tutorial
                                                            
                        
                    
                                    
                        
                                                            RX
                                                    
                        
                            File upload
                            DefinitionFile upload is becoming a more and more essential part of any application, where the user is able to upload their photo, their CV, or a...
                            
                                January 25, 2024
                                                                    •
                                    Rx iT World Hacking Tutorial
                                                            
                        
                    
                                    
                        
                                                            RX
                                                    
                        
                            Error Handling
                            DefinitionError handling is a part of the overall security of an application. Except in movies, an attack always begins with a Reconnaissance phase in which the attacker will...
                            
                                January 25, 2024
                                                                    •
                                    Rx iT World Hacking Tutorial
                                                            
                        
                    
                                    
                        
                                                            RX
                                                    
                        
                            The .NET Framework
                            DefinitionThe .NET Framework is Microsoft’s principal platform for enterprise development. It is the supporting API for ASP.NET, Windows Desktop applications, Windows Communication Foundation services, SharePoint, Visual...
                            
                                January 25, 2024
                                                                    •
                                    Rx iT World Hacking Tutorial
                                                            
                        
                    
                                    
                        
                                                            RX
                                                    
                        
                            Docker Containerization Technology
                            DefinitionDocker is the most popular containerization technology. Upon proper use, it can increase the level of security (in comparison to running applications directly on the host)....
                            
                                January 25, 2024
                                                                    •
                                    Rx iT World Hacking Tutorial
                                                            
                        
                    
                                    
                        
                                                            RX
                                                    
                        
                            Django framework is a powerful Python web framework
                            DefinitionThe Django framework is a powerful Python web framework, and it comes with built-in security features that can be used out-of-the-box to prevent common web vulnerabilities....
                            
                                January 25, 2024
                                                                    •
                                    Rx iT World Hacking Tutorial
                                                            
                        
                    
                                    
                        
                                                            RX
                                                    
                        
                            Django REST Framework
                            DefinitionThe Django REST framework abstracts developers from quite a bit of tedious work and provides the means to build APIs quickly and with ease using Django....
                            
                                January 24, 2024
                                                                    •
                                    Rx iT World Hacking Tutorial
                                                            
                        
                    
                                    
                        
                                                            RX
                                                    
                        
                            Guidance on Deserializing Objects Safely
                            DefinitionSerialization is the process of turning some object into a data format that can be restored later. People often serialize objects in order to save them for...
                            
                                January 24, 2024
                                                                    •
                                    Rx iT World Hacking Tutorial
                                                            
                        
                    
                            
        
        
    
        
            Doctor visit helper
            Prepare before seeing a doctor
            A simple rural-patient checklist to help you explain symptoms clearly, ask better questions, and avoid unsafe self-treatment.
        
        
            Safety note:
            This is not a prescription or diagnosis. For severe symptoms, pregnancy danger signs, children with serious illness, chest pain, breathing difficulty, stroke-like weakness, or major injury, seek urgent care.        
        
            
                Which doctor may help?
                Start with a registered doctor or the nearest qualified health center.
            
            
                What to tell the doctor
                Write when the problem started and how it changed.
Bring old prescriptions, investigation reports, and current medicines.
Write allergies, pregnancy status, diabetes, kidney/liver disease, and major past illnesses.
Bring one family member if the patient is weak, elderly, confused, or a child.
            
            
                Questions to ask
                What is the most likely cause of my symptoms?
Which danger signs mean I should go to hospital quickly?
Which tests are necessary now, and which can wait?
How should I take medicines safely and what side effects should I watch for?
When should I come for follow-up?
            
            
                Tests to discuss
                Vital signs: temperature, pulse, blood pressure, oxygen saturation
Basic physical examination by a clinician
CBC, urine test, blood sugar, or imaging only when clinically needed
            
            
                Avoid these mistakes
                Do not use antibiotics, steroid tablets/injections, or strong painkillers without proper medical advice.
Do not hide pregnancy, kidney disease, ulcer, allergy, or blood thinner use.
Do not delay emergency care when danger signs are present.
            
        
        
            
            
        
    
    
    
        
            💊
            
                Medicine safety and first-aid guide
                This section is for patient education only. It does not replace a doctor, pharmacist, or emergency care.
            
        
        
            
                Safe first steps
                Drink safe fluids and monitor temperature.
In dengue-prone areas, discuss CBC and platelet count when fever persists or warning signs appear.
Use tepid sponging for high fever discomfort; avoid ice-cold bathing.
            
            
                OTC medicine safety
                For fever, common fever medicine may be discussed with a clinician or pharmacist.
Avoid aspirin/ibuprofen-like medicines in suspected dengue unless a doctor says it is safe.
            
            
                Avoid these mistakes
                Do not start antibiotics without a proper medical decision.
Do not use steroid tablets or injections casually for quick relief.
Do not delay emergency care because of home remedies.
            
            
                Get urgent help if
                Fever with breathing difficulty, confusion, repeated vomiting, bleeding, severe weakness, stiff neck, or dehydration needs urgent care.
            
        
        
            Medicine names, dose, and timing must be decided by a qualified clinician or pharmacist after checking age, pregnancy, allergy, other diseases, and current medicines.        
    
    
    
        
            📝
            
                For rural patients and family caregivers
                Patient health record and symptom diary
                Write your symptoms, medicines already taken, test results, and questions before visiting a doctor. This note stays on your device unless you print or copy it.
            
        

        
            Doctor to discuss: Doctor / qualified healthcare provider
            
                Tests to discuss with doctor
                Basic vital signs: temperature, pulse, blood pressure, oxygen level if needed
Relevant blood, urine, imaging, or specialist tests only after clinical assessment
            
            
                Questions to ask
                What is the most likely cause of my symptoms?
Which warning signs mean I should go to emergency care?
Which tests are really needed now?
Which medicines are safe for my age, pregnancy status, allergy, kidney/liver/stomach condition, and current medicines?
            
        

        
            
                Patient name / age
                Main symptom
                How long?
                Severity
            
            Describe symptoms in your own words
            Medicines already taken
            Test results already done
            Questions for the doctor
            
                
                
                
                
            
        
        Emergency warning signs such as chest pain, severe breathing difficulty, sudden weakness, confusion, severe dehydration, major injury, or loss of bladder/bowel control need urgent medical care. Do not wait for online information.
    
    
    
        
            🧭
            
                Safe pathway to proper treatment
                Care roadmap for: Enhanced Inference
                Use this simple roadmap to understand the next safe steps. It is educational and does not replace examination by a doctor.
            
        
        
            Go to emergency care if you notice:
            Severe or rapidly worsening symptoms
Breathing difficulty, chest pain, fainting, confusion, severe weakness, major injury, or severe dehydration
        
        Doctor / service to discuss: Qualified healthcare provider; specialist depends on symptoms and examination.
        
                            
                    Step 1
                    Check danger signs first
                    If danger signs are present, seek emergency care and do not wait for online information.
                
                            
                    Step 2
                    Record the symptom story
                    Write when symptoms started, severity, medicines already taken, allergies, pregnancy status, and test results.
                
                            
                    Step 3
                    Visit a qualified clinician
                    A doctor, nurse, or qualified healthcare provider can examine you and decide which tests or treatment are needed.
                
                            
                    Step 4
                    Do only useful tests
                    Do tests after clinical assessment. Avoid unnecessary tests, random antibiotics, or repeated medicines without diagnosis.
                
                            
                    Step 5
                    Follow up and return early if worse
                    If symptoms worsen, new warning signs appear, or treatment is not helping, return for review quickly.
                
                    
        
            Rural patient practical tips
            Take a written symptom diary and all previous prescriptions/test reports.
Do not hide medicines already taken, even herbal or over-the-counter medicines.
Ask which warning signs mean urgent referral to hospital.
        
        This roadmap is for education. A real diagnosis and treatment plan requires history, examination, and clinical judgment.
    
    
Article writerDr. Harun Ar Rashid, MD - Arthritis, Bones, Joints Pain, Trauma, and Internal Medicine SpecialistMedical writer
T
Medical reviewerTeam RxHarunSpecialist Doctor In This Topic
First publishedJanuary 12, 2024
Last updatedFebruary 8, 2026
Medically reviewed onFebruary 8, 2026
Next planned updateFebruary 8, 2027
Fact-check note: Reviewed for medical accuracy, clarity, and patient safety.
This history box helps readers understand editorial responsibility. Medical content should be reviewed and updated when evidence, guidelines, or clinical practice changes.
    
        
            Internal learning pathway
            Explore related RX articles
            Related guides from RX Harun are grouped to help readers move from overview to symptoms, tests, treatment, and safe next steps.
            Rx iT World Hacking Tutorial
        
        
                            
                    Transient Blockage of the Internal Iliac Artery
                    DefinitionThe internal iliac artery? is a crucial blood vessel in the pelvis?, responsible for supplying blood…
                
                            
                    GraphQL
                    DefinitionGraphQL is an open source query language originally developed by Facebook that can be used to build…
                
                            
                    Forgot Password Service
                    DefinitionIn order to implement a proper user management system, systems integrate a Forgot Password service that allows the…
                
                            
                    File upload
                    DefinitionFile upload is becoming a more and more essential part of any application, where the user…
                
                            
                    Error Handling
                    DefinitionError handling is a part of the overall security of an application. Except in movies, an…
                
                            
                    The .NET Framework
                    DefinitionThe .NET Framework is Microsoft’s principal platform for enterprise development. It is the supporting API for…
                
                    
    
    
				        
                            
                    Previous Article
                    AutoGen Studio
                
            
                            
                    Next Article
                    AutoGen Studio: Interactively Explore Multi-Agent Workflows
                
                    
        					
		
        
        
            Search health topics
            Fast local search from your published RX articles. It works without Algolia plugin.
        
        
            Search
            
            
        
        Start typing to search medical articles.
        
    
    
    
        Search RX Library
        
            Search
            
            
        
    
        
        
            Diseases A-Z
            View all
        
                    We Are Mourn For Orko ( 1st Sacrifizar of Rx War )
footer best final
How to Reset Your Body from Chronic Stress?
Antenatal Abnormality
Enlarged Nasopharyngeal Tonsil
            
        
        
            Drugs
            View all
        
                    We Are Mourn For Orko ( 1st Sacrifizar of Rx War )
footer best final
How to Reset Your Body from Chronic Stress?
Antenatal Abnormality
Enlarged Nasopharyngeal Tonsil
            
        
        
            Lab Tests
            View all
        
                    We Are Mourn For Orko ( 1st Sacrifizar of Rx War )
footer best final
How to Reset Your Body from Chronic Stress?
Antenatal Abnormality
Enlarged Nasopharyngeal Tonsil
            
        
        Latest Medical Articles
        
                            
                    We Are Mourn For Orko ( 1st Sacrifizar of Rx War )
                    June 26, 2026
                
                            
                    footer best final
                    May 25, 2026
                
                            
                    How to Reset Your Body from Chronic Stress?
                    April 7, 2026
                
                            
                    Antenatal Abnormality
                    April 7, 2026
                
                            
                    Enlarged Nasopharyngeal Tonsil
                    April 4, 2026
                
                    
    
        
        Medical content standard
        RX Theme supports writer profiles, reviewer profiles, medical review dates, references, article schema, and patient-friendly education blocks.

Read, save, and share this guide

Article Summary

Key Takeaways

RX Patient Tools

Prepare before seeing a doctor

Which doctor may help?

What to tell the doctor

Questions to ask

Tests to discuss

Avoid these mistakes

Medicine safety and first-aid guide

Safe first steps

OTC medicine safety

Avoid these mistakes

Get urgent help if

Patient health record and symptom diary

Care roadmap for: Enhanced Inference

Check danger signs first

Record the symptom story

Visit a qualified clinician

Do only useful tests

Follow up and return early if worse

Explore related RX articles

Trust and publishing policies

About RX Harun

People behind the mission

Search the Rx Medical Library

Patient pathways

Popular searches

Recent searches

Read, save, and share this guide

Article Summary

Key Takeaways

RX Patient Tools

Tune Inference Parameters (for openai<1)​

Choices to optimize​

Validation data​

Evaluation function​

Metric to optimize​

Search space​

Budgets​

Perform tuning​

API unification​

Usage Summary​

Caching​

Error handling​

Runtime error​

Logic error​

Templating​

Logging (for openai<1)​

Related Articles

Prepare before seeing a doctor

Which doctor may help?

What to tell the doctor

Questions to ask

Tests to discuss

Avoid these mistakes

Medicine safety and first-aid guide

Safe first steps

OTC medicine safety

Avoid these mistakes

Get urgent help if

Patient health record and symptom diary

Care roadmap for: Enhanced Inference

Check danger signs first

Record the symptom story

Visit a qualified clinician

Do only useful tests

Follow up and return early if worse

Explore related RX articles

To Get Daily Health Newsletter

Tune Inference Parameters (for openai<1)

Choices to optimize

Validation data

Evaluation function

Metric to optimize

Search space

Budgets

Perform tuning

API unification

Usage Summary

Caching

Error handling

Runtime error

Logic error

Templating

Logging (for openai<1)