GPT 3.5 calling 4o instead?

1treu1 · April 2, 2025, 5:06pm

I have the same problem. Can you resolve it?

_j · April 2, 2025, 5:27pm

You really have the same problem as a year and a half ago? Can you elaborate more, because it’s working fine here. I just blasted off 100 parallel requests and they are fulfilled and billed, not appearing to increase other model usage counts.

1treu1 · April 2, 2025, 6:09pm

Yes, I use langchain:
@app.route(“/getresponsegpt”, methods=[“GET”])
def getResponseGpt():
user_prompt = request.args.get(“user_prompt”)
embeddingGenerator = OpenAIEmbeddings()

download_embedding()

PATH_VECTORSTORE = "PdfVectorStore"
baseconocimiento = FAISS.load_local(
    PATH_VECTORSTORE + "/faiss_index",
    embeddingGenerator,
    allow_dangerous_deserialization=True
)

docs = baseconocimiento.similarity_search(user_prompt)
# Guardar los mensajes en una carpeta por usuarios
# Guardar los mensajes en un .txt de la siguiente forma por usuario:
## <numero/bot>, <hora> Mensaje
# Primero acceder al excel de deudas en el storage
# Crear los embedings
# 

template = """
Eres Emma, una asistente virtual experta en administración de cuentas Spotify. Tu objetivo es ayudar con: suscripciones, pagos, facturación, problemas técnicos y uso de la plataforma.

Contexto del usuario:
{cx 

Historial de conversación:
{chat_history}

Directrices:
1. Identifica si la consulta está relacionada con Spotify. Si no, responde: "Lo siento, solo puedo responder preguntas relacionadas con Spotify. ¿Puedes intentar de nuevo?"
2. Verifica el estado de la suscripción antes de responder sobre funcionalidades premium
3. Para consultas de pago, confirma los últimos movimientos registrados
4. En problemas técnicos, solicita información del dispositivo y versión de la app

Prioridades:
- Mantén un tono amable y profesional
- Ofrece soluciones paso a paso cuando sea necesario
- Verifica fechas y estado de la conversación
- Incluye enlaces relevantes a la documentación oficial




Cliente: {human_input}
Emma: """
prompt = PromptTemplate(
    input_variables=["context", "chat_history", "human_input"],
    template=template
)
memory = ConversationBufferMemory(
    memory_key="chat_history", 
    input_key="human_input"
)

llm = ChatOpenAI(model_name="gpt-3.5-turbo")
chain = load_qa_chain(
    llm, 
    chain_type="stuff", 
    memory=memory, 
    prompt=prompt
)

respuesta = chain.invoke({
    'input_documents': docs, 
    'human_input': user_prompt
})
print(respuesta['output_text'])

# Crear respuesta serializable
response_data = {
    'answer': respuesta['output_text'],
    'source_documents': [
        {
            'page_content': doc.page_content,
            'metadata': doc.metadata
        } for doc in docs
    ]
}

return jsonify(response_data)

if name == “main”:
app.run(host=‘0.0.0.0’, port=8508, debug=True)

And I use 3.5 but many requests later, it gives me answers as if it were the 4o or 4.5 models but I don’t know why.

look:

1treu1 · April 2, 2025, 6:10pm

The image shows a bar graph depicting "chatgpt-4o-latest" with 8,494 input tokens recorded on March 28, 2025. (Captioned by AI)805×336 12.8 KB

_j · April 2, 2025, 6:25pm

I could blame some strange default or replacement going on in langchain, but chatgpt-4o-latest is a weird one to have invoked in any capacity.

I would log the parameters against the model returned in responses.

Then you can see if it is your code (or langchain’s) messing up, incorrect model fulfillment despite calling correctly, or simply a billing and accounting snafu.

(BTW, if using Azure, you can name a deployment anything you want and simulate the same… )

Topic		Replies	Views
Anyone getting GPT-3 responses when calling API for GPT-4 model? API	7	1312	September 28, 2023
Unknown model 'gpt-4o-mini' API	6	1137	September 23, 2024
I specifically used gpt 3.5. why it used gpt 4 instead sometimes? API	3	1084	September 7, 2023
Confusing response regarding what model used API gpt-4	4	1739	October 11, 2023
API Usage shows GPT-4-0613 instead of GPT-4 API	2	2537	December 19, 2023

GPT 3.5 calling 4o instead?

Related topics