Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

           256x256

           1024x1024

Response:

...

Sample Response Data

Code Block
{
"created": 1680145479,
"data": [
{
"url": "https://oaidalleapiprodscus.blob.core.windows.net/private/org-2FEbJIRL7GXfKmGw2BT9wh9b/user-nk6UUN7L9nFqzGEw67uTMonD/img-FhZpxMrCbiDBR4O62e7pPF08.png?st=2023-03-30T02%3A04%3A39Z&se=2023-03-30T04%3A04%3A39Z&sp=r&sv=2021-08-06&sr=b&rscd=inline&rsct=image/png&skoid=6aaadede-4fb3-4698-a8f6-684d7786b067&sktid=a48cca56-e6da-484e-a814-9c849652bcb3&skt=2023-03-29T17%3A40%3A49Z&ske=2023-03-30T17%3A40%3A49Z&sks=b&skv=2021-08-06&sig=4DF0dw/peG7FSVMUml4ShuQP98T0xECW1gE%2BeutdRAw%3D"
}
]
}

...

Url : The public URL for your image(s).

Best Practices:

Generating Images consume more computational power and hence the replies can be a delayed based on the prompts you give.

AI is a developing field and hence the images produced can be quite inaccurate given the complexity of the prompts provided. Hence finding the correct prompt complexity can sometimes be a challenge.

Speech to Text

Speech-to-text action is used when you want to convert audio input into text. This has variety of use cases such as implementation in IVRs.

Input:

...

File Url : This is the URL for the audio that you want to convert to text. Make sure that the url is a public hosted url ending with audio formats such as mp3. mpeg etc

Note

Please note the url need to start with https:// and end with mp3, or mp4, mpeg, mpga, m4a,wav, webm

Language : The language you want the speech to be converted into. We use ISO-639-1 format which means you need to put languages as ‘en’, ‘es’ etc.

Response:

...

Sample Response Data

Code Block
{
"text": "Welcome to Rensen. This is a test to see if everything works well. And if the IVR can guide you to your work."
}

...

Text : The text which is converted from the speech.

Best Practices:

You can convert speech to text quite accurately using this feature. It is considered a best practice to provie provide the audio in the same language as that of the desired output for more accurate results and latency.

Translate Audio to English

Translate audio to English action is used when you want to convert audio input into text in english language. This has a variety of use cases such as implementation in IVRs.

Input:

...

File Url : This is the URL for the audio that you want to convert to text. Make sure that the url is a public hosted url ending with audio formats such as mp3. mpeg etc

Note

Please note the url need to start with https:// and end with mp3, or mp4, mpeg, mpga, m4a,wav, webm

Response:

...

Sample Response Data

Code Block
{
"text": "Welcome to Rensen. This is a test to see if everything works well. And if the IVR can guide you to your work."
}

...

Text: The text which is converted from the speech.

Best Practices:

Experimenting with different formats of audio can provide more (or less) accurate results. This is simply due to the quality of the audio provided so make sure you split test with different formats to achieve the best possible format for your use case.

Create Chat Completion

Chat completion offers a functionality of sending prompts to OpenAI in textual form and based on the prompt receive an answer. This is similar to text completion action but it uses ChatGPT which is 10x faster and cheaper as well.

Input:

...

System Message : This is an optional field and is used to provide additional context about you or your business when completing chats.

...