StepfunPaid
Step 3.7 Flash
Step 3.7 Flash is StepFun's latest high-efficiency multimodal Mixture-of-Experts model. It pairs a 196B-parameter language backbone with a vision encoder for native image and video understanding, activating roughly 11B parameters...
stepfun/step-3.7-flash
Capabilities
🧠Reasoning👁️Vision🎬Video🔧Tools🧩Structured📜Long context
Specifications
Context window
256K tokens
Input price
$0.20/M
Output price
$1.15/M
Provider
Stepfun
Input modalities
text, image, video
Output modalities
text
Pricing
Pay-per-token
Model ID
stepfun/step-3.7-flash
Strengths
- +Strong step-by-step reasoning
- +Understands images (vision input)
- +Supports tool / function calling
- +Large 256K-token context window
- +Low cost per token
Considerations
- –No notable limitations for general use