API Endpoints
Complete reference for all DeepAILab API endpoints. Each endpoint includes parameters, code examples, and response formats.
https://api.deepailab.ai/v1/models/{model_id}/trainTrain Model#
Start a training or fine-tuning job for a model. The training job will run asynchronously and you can track its progress via the experiments endpoint.
Request body
| Parameter | Required | Complete reference for all DeepAILab API endpoints. Each endpoint includes parameters, code examples, and response formats. |
|---|---|---|
dataset_idstring | Required | The ID of the dataset to use for training. |
hyperparametersobject | Optional | Training configuration including epochs, batch_size, learning_rate, etc. |
validation_splitnumber | Optional | Fraction of data to use for validation (default: 0.1). |
dataset_idRequiredThe ID of the dataset to use for training.
hyperparametersOptionalTraining configuration including epochs, batch_size, learning_rate, etc.
validation_splitOptionalFraction of data to use for validation (default: 0.1).
Example request
curl https://api.deepailab.ai/v1/models/model_abc123/train \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $DEEPAILAB_API_KEY" \
-d '{
"dataset_id": "ds_xyz789",
"hyperparameters": {
"epochs": 3,
"batch_size": 32,
"learning_rate": 2e-5
}
}'Response
{
"id": "job_train_99381",
"object": "training.job",
"model_id": "model_abc123",
"status": "pending",
"created_at": 1698765432,
"hyperparameters": {
"epochs": 3,
"batch_size": 32,
"learning_rate": 2e-5
}
}https://api.deepailab.ai/v1/models/{model_id}/predictCreate Prediction#
Run inference on a deployed model. Send input data and receive predictions. The model must be deployed before calling this endpoint.
Request body
| Parameter | Required | Complete reference for all DeepAILab API endpoints. Each endpoint includes parameters, code examples, and response formats. |
|---|---|---|
inputsobject | Required | Input data matching the model's expected schema. |
parametersobject | Optional | Inference parameters like temperature, top_k, etc. |
inputsRequiredInput data matching the model's expected schema.
parametersOptionalInference parameters like temperature, top_k, etc.
Example request
curl https://api.deepailab.ai/v1/models/model_abc123/predict \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $DEEPAILAB_API_KEY" \
-d '{
"inputs": {
"text": "Classify this customer review sentiment."
}
}'Response
{
"id": "pred_12345",
"object": "prediction",
"model_id": "model_abc123",
"predictions": [
{
"label": "positive",
"score": 0.9823
}
],
"usage": {
"input_tokens": 8,
"latency_ms": 45
}
}https://api.deepailab.ai/v1/nas/searchesCreate NAS Search#
Start an automated Neural Architecture Search (NAS) to find the optimal model architecture for your task and constraints.
Request body
| Parameter | Required | Complete reference for all DeepAILab API endpoints. Each endpoint includes parameters, code examples, and response formats. |
|---|---|---|
task_typestring | Required | Type of task: 'classification', 'detection', 'segmentation', etc. |
dataset_idstring | Required | Dataset to use for architecture evaluation. |
constraintsobject | Required | Resource constraints including max_latency_ms, max_params, max_flops. |
search_spacestring | Optional | Predefined search space: 'small', 'medium', 'large' (default: 'medium'). |
task_typeRequiredType of task: 'classification', 'detection', 'segmentation', etc.
dataset_idRequiredDataset to use for architecture evaluation.
constraintsRequiredResource constraints including max_latency_ms, max_params, max_flops.
search_spaceOptionalPredefined search space: 'small', 'medium', 'large' (default: 'medium').
Example request
curl https://api.deepailab.ai/v1/nas/searches \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $DEEPAILAB_API_KEY" \
-d '{
"task_type": "image_classification",
"dataset_id": "ds_imagenet_subset",
"constraints": {
"max_latency_ms": 50,
"max_params": 5000000
}
}'Response
{
"id": "nas_88219",
"object": "nas.search",
"status": "running",
"task_type": "image_classification",
"created_at": 1698765432,
"estimated_duration_hours": 4,
"architectures_evaluated": 0,
"best_architecture": null
}https://api.deepailab.ai/v1/models/{model_id}/deployDeploy Model#
Deploy a trained model to a production endpoint. Once deployed, the model can serve real-time predictions via the predict endpoint.
Request body
| Parameter | Required | Complete reference for all DeepAILab API endpoints. Each endpoint includes parameters, code examples, and response formats. |
|---|---|---|
replicasinteger | Optional | Number of instances to run (default: 1). |
instance_typestring | Optional | Compute instance type: 'cpu', 'gpu-small', 'gpu-large'. |
auto_scalingobject | Optional | Auto-scaling configuration with min_replicas and max_replicas. |
replicasOptionalNumber of instances to run (default: 1).
instance_typeOptionalCompute instance type: 'cpu', 'gpu-small', 'gpu-large'.
auto_scalingOptionalAuto-scaling configuration with min_replicas and max_replicas.
Example request
curl https://api.deepailab.ai/v1/models/model_abc123/deploy \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $DEEPAILAB_API_KEY" \
-d '{
"replicas": 2,
"instance_type": "gpu-small"
}'Response
{
"id": "dep_11235",
"object": "deployment",
"model_id": "model_abc123",
"status": "provisioning",
"endpoint_url": "https://api.deepailab.ai/v1/endpoints/dep_11235",
"replicas": 2,
"instance_type": "gpu-small",
"created_at": 1698765432
}https://api.deepailab.ai/v1/experimentsList Experiments#
Returns a list of experiments (training jobs, evaluations) belonging to the user's organization.
Request body
| Parameter | Required | Complete reference for all DeepAILab API endpoints. Each endpoint includes parameters, code examples, and response formats. |
|---|---|---|
statusstring | Optional | Filter by status: 'pending', 'running', 'completed', 'failed'. |
limitinteger | Optional | Number of experiments to return (default: 20, max: 100). |
afterstring | Optional | Cursor for pagination. Use the id of the last experiment from previous request. |
statusOptionalFilter by status: 'pending', 'running', 'completed', 'failed'.
limitOptionalNumber of experiments to return (default: 20, max: 100).
afterOptionalCursor for pagination. Use the id of the last experiment from previous request.
Example request
curl "https://api.deepailab.ai/v1/experiments?status=running&limit=10" \
-H "Authorization: Bearer $DEEPAILAB_API_KEY"Response
{
"object": "list",
"data": [
{
"id": "exp_123",
"object": "experiment",
"name": "BERT Fine-tuning V2",
"status": "running",
"created_at": 1698765432,
"metrics": {
"loss": 0.45,
"accuracy": 0.89
}
}
],
"has_more": true
}