Rate Limits
Rate limiting serves multiple purposes:
Security: It mitigates the risk of abuse or misuse by preventing malicious users from flooding the API with excessive requests, which could disrupt services.
Fair Access: Rate limiting ensures equitable access to the API by restricting any single user or application from monopolizing resources, enabling all users to leverage the API effectively.
Performance Optimization: By managing the overall load on our infrastructure, rate limiting prevents perfFew readersModel Status
Model status can be cold and warm refer to how long it takes to launch a machine learning model to accept requests.
Cold Boot: When a model hasn't been used in a while, it gets turned off to conserve resources. This is similar to completely turning off your computer. When you make a request to use the model again, it needs to be fully loaded and started up, which can take several minutes for large models. This is a cold boot.
Warm Boot: If a model has been used recently, it stays loFew readersLogging
Overview
At Segmind, we value your privacy and ensure that the information we log is strictly for operational and billing purposes. This document outlines what data we collect, how we use it, and our policies regarding the storage of your data.
What We Log
Usage and Billing Metrics:
We log basic information to help us calculate usage and billing metrics. This includes:
Time of Inference: The exact date and time each inference is run.
Inference Duration: The timeFew readersAccount and Billing APIs
Credit balance API
To get your credits balance, you can make the following request.
which will return you a response similar to this:Few readersMonitoring
API Usage Dashboard Documentation
This documentation provides an overview of the API Usage Dashboard, designed to help users monitor and manage their API consumption. The dashboard displays information on API usage, helping you keep track of requests and associated costs. It might take upto two hours before the latest data is available on this dashboard.
Overview of the Dashboard
Credits: This shows the total credit baFew readersSegmind Storage
Segmind Storage allows you to upload files as assets and get URLs that can be used with other models. This is particularly useful when you need to provide image inputs to various AI models without repeatedly uploading the same file.
Upload Asset API
Upload files to Segmind Storage and receive a URL that can be used across different models.
Endpoint
Headers
Header
Value
Required
Description
accept
application/jsoFew readersRandom Seed
Across Segmind platform, you can set the seed value to -1 to randomize the value for seed. This would instructs the system to automatically generate and use a random seed each time it runs. The random seed is also returned in the response header for reproducibility. This ensures that the output remains predictably different each time, providing fresh results and avoiding repetitive patterns that may arise from using a fixed seed.
This approach is particularly useful in situations where you needFew readersPremium models
Closed source models and models that are compute heavy are not available to free accounts and require a subscription to access or minimum $5 of credit balance. List of premium models include:
gpt-4
gpt-4o
gpt-4-turbo
gpt-3.5-turbo
llama-v3-70b-instruct
llama-v3-8b-instruct
llama-v3p1-405b-instruct
llama-v3p1-8b-instruct
llama-v3p1-70b-instruct
claude-3.5-sonnet
claude-3-opus
claude-3-haiku
gemini-1.5-flash
gemini-1.5-pro
mixtral-8x22b-instruct
mixtral-8x7b-instruFew readers