It sounds like you need to develop a structured schema for your data. That’s not just someone writing random stuff, there are field being filled and reported.
Extraction from the problem database directly into a JSON, or an AI transformation into an output schema, will unify the data and produce better semantic search results.
{
"name": "issue_report",
"schema": {
"type": "object",
"properties": {
"issue_summary": {
"type": "object",
"properties": {
"problem": {
"type": "string",
"description": "Description of the problem related to comment acceptance."
},
"character_limits": {
"type": "object",
"properties": {
"lokia_mobile": {
"type": "string",
"description": "Character limit allowed in Lokia Mobile."
},
"lokia_server_side": {
"type": "string",
"description": "Character limit retained in Lokia server-side."
}
},
"required": [
"lokia_mobile",
"lokia_server_side"
],
"additionalProperties": false
},
"impact": {
"type": "string",
"description": "Impact of the issue on the transmission to the SYNERGEE system."
}
},
"required": [
"problem",
"character_limits",
"impact"
],
"additionalProperties": false
},
"people_involved": {
"type": "object",
"properties": {
"reported_by": {
"type": "string",
"description": "Name of the person who reported the issue."
}
},
"required": [
"reported_by"
],
"additionalProperties": false
},
"action_items_identified": {
"type": "object",
"properties": {
"modification_needed_in_lokia": {
"type": "string",
"description": "Description of the modification needed in Lokia."
},
"assigned_person": {
"type": "string",
"description": "Name of the person assigned to the task."
},
"validation_required": {
"type": "string",
"description": "Description of what needs to be validated."
}
},
"required": [
"modification_needed_in_lokia",
"assigned_person",
"validation_required"
],
"additionalProperties": false
},
"suggested_next_steps": {
"type": "object",
"properties": {
"technical_fix": {
"type": "string",
"description": "Description of the technical fix needed."
},
"testing": {
"type": "array",
"description": "List of testing requirements.",
"items": {
"type": "string"
}
},
"deployment": {
"type": "string",
"description": "Description of the deployment process."
}
},
"required": [
"technical_fix",
"testing",
"deployment"
],
"additionalProperties": false
}
},
"required": [
"issue_summary",
"people_involved",
"action_items_identified",
"suggested_next_steps"
],
"additionalProperties": false
},
"strict": true
}
The problem still remains, if you want to answer “create a new problem report ONLY if nothing like this exists at all”.
You’d be able to answer “create a new problem report if the search finds nothing and would continue to find nothing for like inputs.”