API Reference | docs

Rate Limits
Rate limiting serves multiple purposes: Security: It mitigates the risk of abuse or misuse by preventing malicious users from flooding the API with excessive requests, which could disrupt services. Fair Access: Rate limiting ensures equitable access to the API by restricting any single user or application from monopolizing resources, enabling all users to leverage the API effectively. Performance Optimization: By managing the overall load on our infrastructure, rate limiting prevents perfFew readers
Model Status
Model status can be cold and warm refer to how long it takes to launch a machine learning model to accept requests. Cold Boot: When a model hasn't been used in a while, it gets turned off to conserve resources. This is similar to completely turning off your computer. When you make a request to use the model again, it needs to be fully loaded and started up, which can take several minutes for large models. This is a cold boot. Warm Boot: If a model has been used recently, it stays loFew readers
Logging
Overview At Segmind, we value your privacy and ensure that the information we log is strictly for operational and billing purposes. This document outlines what data we collect, how we use it, and our policies regarding the storage of your data. What We Log Usage and Billing Metrics: We log basic information to help us calculate usage and billing metrics. This includes: Time of Inference: The exact date and time each inference is run. Inference Duration: The timeFew readers
Account and Billing APIs
Credit balance API To get your credits balance, you can make the following request. which will return you a response similar to this:Few readers
Monitoring
API Usage Dashboard Documentation This documentation provides an overview of the API Usage Dashboard, designed to help users monitor and manage their API consumption. The dashboard displays information on API usage, helping you keep track of requests and associated costs. It might take upto two hours before the latest data is available on this dashboard. Overview of the Dashboard Credits: This shows the total credit baFew readers
Segmind Storage
Segmind Storage allows you to upload files as assets and get URLs that can be used with other models. This is particularly useful when you need to provide image inputs to various AI models without repeatedly uploading the same file. Upload Asset API Upload files to Segmind Storage and receive a URL that can be used across different models. Endpoint Headers Header Value Required Description accept application/jsoFew readers
Random Seed
Across Segmind platform, you can set the seed value to -1 to randomize the value for seed. This would instructs the system to automatically generate and use a random seed each time it runs. The random seed is also returned in the response header for reproducibility. This ensures that the output remains predictably different each time, providing fresh results and avoiding repetitive patterns that may arise from using a fixed seed. This approach is particularly useful in situations where you needFew readers
Premium models
Closed source models and models that are compute heavy are not available to free accounts and require a subscription to access or minimum $5 of credit balance. List of premium models include: gpt-4 gpt-4o gpt-4-turbo gpt-3.5-turbo llama-v3-70b-instruct llama-v3-8b-instruct llama-v3p1-405b-instruct llama-v3p1-8b-instruct llama-v3p1-70b-instruct claude-3.5-sonnet claude-3-opus claude-3-haiku gemini-1.5-flash gemini-1.5-pro mixtral-8x22b-instruct mixtral-8x7b-instruFew readers