Filter

My recent searches
Filter by:
Budget
to
to
to
Type
Skills
Languages
    Job State
    1,985 cuda jobs found

    ...augmentation and dataset pipelines comparable to tf.data. • Model Evaluation – training loops, metric tracking, checkpointing and model save/load logic. Runtime expectations The library will be used in standalone fashion; it only needs to expose a public API that any C# solution can reference. Low-level compute may target CPU initially, but the architecture should stay extensible enough to plug in CUDA, DirectML or other accelerators later. Deliverables 1. Source code with clear namespace organisation and XML documentation. 2. A sample project that trains and evaluates at least one CNN on MNIST using only the new library. 3. Step-by-step build instructions and a short design document outlining graph execution, back-propagation implementation and extension p...

    $132 Average bid
    $132 Avg Bid
    47 bids

    ...Python 3.11, FastAPI, SQLAlchemy + Alembic, PostgreSQL, Redis, WebSocket • AI layer – YOLOv8 (people detection / counting), DeepSORT (tracking), RetinaFace + InsightFace-ArcFace (face detection / recognition), FAISS for vector search, OpenCV to pull RTSP streams • Frontend – 15 with TypeScript, Tailwind CSS, Recharts for dashboards • Infrastructure – Docker, docker-compose, NVIDIA CUDA runtime for GPU inference, Nginx reverse proxy Backend reliability and performance sit at the top of the priority list, so I expect an asynchronous FastAPI setup, connection pooling, health checks, graceful shutdown, and clear separation between I/O-bound and GPU-bound tasks. Deliverables should include: 1. Well-documented repo (or mono-repo) layout f...

    $326 Average bid
    $326 Avg Bid
    40 bids

    ...powered by an RTX 5090 and want to turn it into a reliable, local playground for the latest stable releases of Stability Matrix, Stable Diffusion, MidJourney, ComfyUI, and Kohya. I need more than a basic installer—I’d like the full environment properly configured, GPU-accelerated, and ready to create images the moment we finish. You’ll connect through a secure remote session, handle every dependency (CUDA, Python, libraries, checkpoints, etc.), and tune paths so each tool cooperates without conflicts. Where optional models or extensions improve usability, feel free to suggest and add them as long as they remain stable. Deliverables • All five applications installed in their most recent stable versions • Verified ability to generate a sample image...

    $516 Average bid
    $516 Avg Bid
    18 bids

    I am looking for an expert to develop a high-performance, 8-channel automated farming system for the 3D tactical shooter Delta Force. My requirements: Technical Skills & Experience: - Proven experience in real-time computer vision (YOLO, TensorRT) with batch inference and low latency on RTX 4070Ti/4060Ti GPUs. - Strong proficiency in C++ (or high-performance Python with CUDA/C++ backend) and PCIe capture card integration. - Familiarity with KMBOX Net HID-level mouse/touch emulation and anti-cheat evasion (Tencent ACE or similar). - Experience designing centralized dashboards and robust fail-safe management for 24/7 operations. Core Deliverables: - Real-time detection (loot, enemies, extraction points) and UI state recognition via 8x 1080p 60Hz streams. - Visual navigation usi...

    $4099 Average bid
    $4099 Avg Bid
    27 bids

    I need an Ubunt...so I can run machine-learning workloads. The job is purely the installation and configuration of everything required for the guest OS to recognise and fully utilise the GPU (nvidia-smi must work). Key points • VMware host is already installed and stable; you decide whether PCI passthrough, vGPU or another VMware feature makes most sense. • Inside the VM I will need the correct NVIDIA driver, CUDA toolkit and cuDNN ready for TensorFlow or PyTorch later on. • Once complete I should be able to launch a quick test script that confirms the GPU is visible from within Ubuntu. Deliverable A step-by-step session (screenshare or detailed command log) that leaves the VM running and the GPU operational, plus any commands I can rerun if I ever have to r...

    $54 Average bid
    $54 Avg Bid
    14 bids

    ...Video Processing * FFmpeg integration required --- ## File & Project Structure Each project should store: * Audio file * Lyrics * Scene data (JSON) * Generated images * Generated clips * Final output --- ## Advanced Features (Preferred) * Beat-synced cuts and transitions * AI-assisted prompt generation * Style consistency across scenes * Character continuity (optional) * GPU acceleration (CUDA support) --- ## Deliverables * Fully working desktop application * Clean, maintainable Python code * Installation/setup instructions * Ability to run locally without cloud dependency (preferred) --- ## Notes for Developer * This is not a simple generator — it is a **production tool** * User control over scenes is critical * Performance and stability are important * M...

    $386 Average bid
    $386 Avg Bid
    92 bids

    ...Ubuntu and I need the open-source WAN Video 2.7 stack fully installed and running on it. You will connect over AnyDesk, handle the complete setup, pull all required libraries and system dependencies, and verify the application launches cleanly with GPU acceleration enabled. During the session I will stay online to provide root access and restart the box if needed. Please be comfortable working in a CUDA-centric environment, compiling from source when binaries are not available, and troubleshooting driver or codec conflicts that sometimes appear on DGX hardware. Acceptance criteria • WAN Video Open Source 2.7 starts without errors and streams a test feed. • All supporting packages and services are documented in a short README so I can replicate the build later. &...

    $31 Average bid
    $31 Avg Bid
    15 bids

    ...Long-term work possible if successful Scope of Work: - Diagnose deep learning pipeline issues - Fix model execution errors - Debug training / inference workflow - Resolve dependency or environment conflicts - Optimize pipeline stability - Ensure end-to-end execution works correctly - Provide brief documentation of fixes Technical Stack: - Python - PyTorch / TensorFlow - HuggingFace / Transformers - CUDA / GPU acceleration - Docker / Linux environment - API integration & Data preprocessing pipeline Requirements: - Strong experience in Deep Learning production workflows - Experience debugging complex AI pipelines - Comfortable working under urgent timelines and ability to start immediately Timeline: Start: Immediately. Expected turnaround: 24–48 hours. Proposal Requi...

    $33 / hr Average bid
    $33 / hr Avg Bid
    89 bids

    I’m building a camera-based ... log every reading with a timestamp, and trigger a visual or audible alert whenever negative emotions are detected repeatedly within a short window. A lightweight dashboard served with either Streamlit or Flask will let me: • watch the annotated video feed • view rolling emotion statistics and charts • review and download the timestamped log of events and alerts Optimisation for Jetson (CUDA, cuDNN, TensorRT where appropriate) is essential, and the finished app should launch from a single command, open the dashboard in a browser, sustain real-time performance, and shut down cleanly. Please keep the code modular and well commented so I can retrain or swap models later and, if convenient, provide a Dockerfile or setup script ...

    $556 Average bid
    $556 Avg Bid
    15 bids

    ...personalized address generator written in C/C++ and CUDA. The tool should have the following features: • Read any number of prefix-suffix patterns from the `` file (example format: `Taaa*1111` or `Tbbb*222`). • Launch a GPU kernel to continuously generate wallet addresses and compare each address with all patterns. If a match is found, write the matching address and its private key to disk. • Fully utilize GPU performance, achieving the same speed as my current test version (approximately 8 billion addresses per second). Please display a "addresses per second" counter in real-time during program execution. • Generate a plain text log file recording key events: startup time, device information, running hash rate snapshots, and each match found. ...

    $530 Average bid
    $530 Avg Bid
    31 bids

    ... "Zero-Shot" Virtual Try-On pipeline into an existing Flutter/Python e-commerce stack. Technical Stack Requirements AI/ML: Experience with IDM-VTON, Cat-VTON, or OOTDiffusion. Mastery of Stable Diffusion (ControlNet/IP-Adapter) is mandatory. Computer Vision: Expertise in MediaPipe or OpenPose (pose estimation) and DensePose (surface mapping). Backend: Python (FastAPI/PyTorch), gRPC/REST, and CUDA optimization. Frontend Integration: Flutter (Dart) for image handling and state management. Key Deliverables The "Zero-Retrain" Pipeline: A model that accepts a flat garment image and a user photo to produce a drape-accurate result without per-SKU training. Latency Optimization: Implementation of TensorRT or AITemplate to bring inference time under 3 seconds on ...

    $2048 Average bid
    $2048 Avg Bid
    42 bids

    ...post_content string. No Raw HTML: Mapping must use native Divi 5 module settings (Colors, Padding, Fonts, Flexbox) to ensure the layout is fully editable. Technical Stack Language: Python (FastAPI/Flask for backend, PyQt or Streamlit for local UI). Browser Automation: Playwright or Selenium (Stealth mode). OS: Windows 11. Optimization: Must be able to handle local inference calls via RTX 5090 (CUDA). Budget & Milestones ($1000 Total) Milestone 1 ($200): Functional Site Crawler (URL Listing & Selection). Milestone 2 ($400): Core Conversion Engine (Successfully importing a complex Section into Divi 5 at 100% progress). Milestone 3 ($400): Full UI Implementation, Section Slicing, and Local API Integration. Note to Freelancers: I will provide a Reference JSON file ...

    $1156 Average bid
    $1156 Avg Bid
    238 bids

    ...from a watch-list I will provide. Because the cameras operate 24/7 in very mixed environments—low-light corridors, exposed outdoor zones that face rain or glare, and busy high-traffic entry points—the model must remain accurate under those conditions. Solutions that leverage YOLO, TensorFlow, PyTorch, OpenCV or comparable frameworks are fine as long as they run on my existing Nvidia GPU server (CUDA-enabled). Deliverables 1. Trained model files plus any custom scripts. 2. A lightweight API or service (Python preferred) that ingests RTSP streams, performs detection, and triggers my existing alerting webhook. 3. Setup instructions and a brief validation report showing performance in the three stated conditions (night-time, outdoor weather, high traffic). I ...

    $136 Average bid
    $136 Avg Bid
    52 bids

    ... Here is what I need delivered: • High-quality masks for every image, respecting a class list that includes typical road-scene elements (road, sidewalk, vehicles, sky, vegetation, building façades, pedestrians) plus key indoor objects you would expect in a café setting (tables, chairs, walls, floor, counter). • A training pipeline in PyTorch or TensorFlow that I can run on Ubuntu 22.04 with CUDA, along with a clear README covering dataset preparation, training, and inference. • A model that reaches at least 0.75 mIoU on a private test split I will share once the annotations are complete. You are free to use tools such as CVAT, LabelMe, Detectron2, DeepLabV3+, SegFormer—or any comparable framework—as long as the final workflow remain...

    $121 Average bid
    $121 Avg Bid
    10 bids

    ...into the transcript with millisecond accuracy. Both real-time feedback (small overlay suggestions) and post-video analytics (downloadable PDF/CSV plus on-screen dashboard) are needed. I’m happy for you to build with tools such as OpenCV, MediaPipe, TensorFlow, PyTorch, spaCy or similar—use what you are fastest with as long as the models run efficiently in a web environment (GPU acceleration via CUDA or WebGL is a plus). Deliverables 1. Source-controlled codebase ready to deploy on a standard cloud stack (Docker image or Heroku-style procfile). 2. Front-end UI (React, Vue or vanilla JS) that lets users toggle between real-time and upload modes. 3. Modular inference services for vision and audio that can be retrained or swapped if I add new metrics later. 4. C...

    $1263 Average bid
    $1263 Avg Bid
    37 bids

    ...data and route only the most promising parameter sets back to the gate model. Latencies must stay sub-millisecond from signal to order, so a coherent design for GPU–FPGA–QPU orchestration is essential. Deliverables • A documented architecture diagram showing data flow between classical AI, middleware, and the chosen quantum SDK (Qiskit, Braket or similar). • Clean, modular Python code with C++/CUDA kernels where latency demands it, fully containerised for reproducibility. • Back-test and forward-test reports on at least one major FX pair and a US equity futures contract, including Sharpe, max drawdown, and execution slippage statistics. • Deployment guide for a colocation environment, covering queue management to the quantum back-end and f...

    $221 Average bid
    $221 Avg Bid
    14 bids

    ...job is to create the complete vision-detection module—from model training or fine-tuning through to a clean ROS 2 node that subscribes to an image topic and spits out the detected objects with bounding boxes (or masks) and a confidence score. OpenCV, TensorFlow/PyTorch and any of the common ROS 2 image-transport plugins are all fine as long as the final node runs on Humble and stays GPU-agnostic (CUDA acceleration is a bonus, not a requirement). I already have a test rig with a standard USB camera; if you need specific calibration images I can capture them for you. Please deliver: • Source code for the detection model and ROS 2 node • A launch file that brings everything up with default parameters • A brief README explaining setup, parameters and expect...

    $146 Average bid
    $146 Avg Bid
    41 bids

    ...environment that emulates Jet Nano hardware for research and development on machine-learning models. The goal is to give my team a sandbox where we can move seamlessly from data preprocessing and feature extraction through model training, evaluation, deployment, and monitoring—without touching the physical board until we are ready. Here’s what I need: • A reproducible simulation that mirrors Jet Nano’s CUDA-enabled GPU, memory constraints, and I/O. • Containerised tool-chain (PyTorch, TensorRT, cuDNN, etc.) with scripts that cover the full life-cycle: preprocessing, training, hyper-parameter sweeps, evaluation metrics, and a mock-deployment stage that tracks resource usage and latency. • Clear documentation so any teammate can spin up the en...

    $2205 Average bid
    $2205 Avg Bid
    63 bids

    ... • Accepts at least JPEG files for input; adding PNG or BMP later should remain possible. • Generates a short video (MP4 preferred) by feeding the image through Stable Diffusion and WAN2.6. • Interface must feel intuitive for non-technical users while exposing advanced settings in an “expert” panel. • Conversion speed is critical; please optimise GPU utilisation and let me choose device (CUDA / DirectML). • Output parameters—resolution, frame rate, length, prompt text, CFG scale, seed—should all be editable before rendering. Deliverables 1. Executable installer (or portable folder) with all weights and dependencies bundled for offline use. 2. Source code with clear build instructions so I can re-compile if models up...

    $106 Average bid
    $106 Avg Bid
    38 bids

    Backend for a my app using FastAPI, WebSock...and concurrency: Comfortable designing and debugging async workflows. Hands‑on AI integration experience with at least one of: Whisper STT or other speech‑to‑text engines. LLaMA/transformer‑based LLMs or OpenAI‑style APIs. TTS systems such as Coqui, Kokoro, or Piper. Realtime systems: WebSockets, WebRTC, or other low‑latency streaming architectures. Nice to Have GPU & deployment experience: CUDA, GPU environments, and performance tuning (CPU vs GPU). Docker, nginx, PM2, and production deployment pipelines. Background processing: Job queues/workers for heavy audio/video processing. Experience orchestrating long‑running media/AI tasks. Video processing tools: FFmpeg, Wav2Lip, or similar for video generation and post‑processing.

    $20 / hr Average bid
    $20 / hr Avg Bid
    193 bids

    ...Predict response to therapy (Responder / Non-responder) Predict survival category Predict recurrence risk For MVP: Start with diagnosis, then add treatment prediction. STEP 2: Setup Development Environment Install Dependencies Python 3.9+ PyTorch MONAI pydicom numpy scikit-learn FastAPI or Flask Example: pip install monai torch torchvision pydicom fastapi uvicorn scikit-learn Setup GPU Local CUDA GPU OR Cloud (AWS/GCP/Azure) STEP 3: PET Scan Dataset Preparation Collect Dataset Public PET database (e.g., TCIA) Research partnership dataset Must include: PET images Diagnosis labels (Optional) treatment outcome labels Organize Data Structure: data/ train/ val/ test/ Handle DICOM Files Use pydicom to read images Convert to 3D tensors Normalize voxel intensity STEP 4:...

    $656 Average bid
    $656 Avg Bid
    54 bids

    ...training worker (Docker, from scratch) - PHP/MySQL licensing backend + Stripe webhook integration - Unified cross-platform installer (detects DAWs, installs everything in one pass) - GitHub Actions CI/CD (Windows + macOS builds) - Full Apple + Windows code signing pipeline - Documentation (User Guide + Developer Guide + BYOK Setup) Key technical requirements: - CPU default with automatic NVIDIA CUDA detection for Live Mode - RMVPE primary pitch extraction + user toggle (Harvest/Crepe/FCPE) - High-quality resampling (44.1k-96k) in C++ wrapper - AI Cleaning (de-reverb/isolation) in front of inference chain - Index Rate + .index file exposed in UI/API - Batch processing via ZMQ socket bridge Terms agreed: - Budget: $2,500 (6 milestones) - Timeline: 6 weeks (Feb 23 - Apr 3, 2026) -...

    $2500 Average bid
    $2500 Avg Bid
    1 bids

    ...similar) - Weasyprint or ReportLab for PDF - Typer CLI with subcommands: - transcribe - diarize - lesson-report - aggregate - YAML config file - Logging, progress bars, caching (skip if output exists), error handling Deliverables: - Full repo structure - All source code (src/ layout, CLI, config, prompts, PDF renderer) - Installation instructions for Windows 11 (Python, ffmpeg, Poetry, CUDA) - Example commands - Test guide with sample audio Please show experience with WhisperX / faster-whisper, Pyannote, Ollama, and Weasyprint on Windows + GPU setups in your proposal. Thank you! Vladimir...

    $1089 Average bid
    $1089 Avg Bid
    78 bids

    ...rapid target motion • Adapt to scale and orientation changes • Maintain lock under partial occlusion • Recover gracefully if tracking confidence drops • Avoid drift over time A re-detection or hybrid tracking strategy is preferred if it improves robustness. Technical Requirements Preferred stack: • Python + OpenCV OR C++ + OpenCV • Modular architecture • Hardware acceleration support (CUDA / TensorRT) is a strong plus • Experience with: • Siamese-based trackers • DeepSORT-like approaches • Hybrid detection + tracking pipelines Clean, well-documented code is mandatory. Deliverables 1. Fully functional Linux application 2. Source code repository 3. Setup instructions + dependency list 4. Short demo video...

    $2569 Average bid
    NDA
    $2569 Avg Bid
    31 bids

    ...website. I have the hardware available but need an expert who can install the model, configure all dependencies, and expose an endpoint that my front-end widget can call. Here is what I have in mind: • Select and download an open-weight GPT-like model that can reasonably run on local hardware (e.g., Llama-2, Mistral, or another suitable alternative). • Set up the execution environment—Python, CUDA, PyTorch or TensorFlow—plus any supporting libraries (LangChain, FastAPI, uvicorn, etc.). • Create or refine an inference script that keeps response times low enough for smooth chat. • Build a lightweight API (REST or WebSocket) so the website can pass the user’s prompt and receive the model’s reply. • Hand me clear, repeatable...

    $212 Average bid
    $212 Avg Bid
    53 bids

    ...expectations • Real-time theft detection logic that raises an event or REST webhook the moment a suspicious removal is spotted • On-screen bounding boxes and confidence scores for detected grocery items and customers • Continuous customer counter with hourly CSV/JSON export • Installers or scripts for Windows 10/11 and Raspberry Pi OS, including all required Python, OpenCV, PyTorch/ONNX, CUDA (where available) dependencies • A simple dashboard that shows live feed thumbnails, current customer count, and the last N theft alerts • Clear instructions on adding new grocery SKUs later Acceptance will be based on: 1. Smooth 25-30 fps inference on 1080p streams under Windows with GPU, and ≥10 fps on Raspberry Pi using CPU or a USB accele...

    $729 Average bid
    $729 Avg Bid
    124 bids

    ...the hardware allocated and wallets ready; what I need is an engineer who can take the nodes from zero to profitable operation and then keep them humming. Key tasks • Provision and secure each H100 instance, configure networking, firewalls, SSH keys and wallets • Containerise the stacks with Docker (Kubernetes or Podman are possible later, but Docker is fine for the first iteration) • Tune CUDA-level settings so every GPU cycle counts and rewards are maximised • Build simple Bash or Python scripts that monitor logs, restart on failure and push basic alerts • Produce step-by-step documentation so the setup can be replicated or audited at any time Acceptance criteria • Nodes reach consensus, stay above 99 % uptime and begin generating rewa...

    $509 Average bid
    $509 Avg Bid
    52 bids

    Lead AI / Fullstack Engineer — ...communication. ​Traffic Localization: Optimize routing protocols to maximize performance within the TAS-IX network. ​Candidate Requirements ​AI / ML Engineering: ​Proven experience with End-to-end (E2E) speech models (Moshi, AudioLM, or similar). ​Deep proficiency in PyTorch and Transformer architectures. ​Hands-on experience in Fine-tuning LLMs/S2S models for new language groups. ​Expertise in CUDA 12.x and NVIDIA optimization libraries. ​Fullstack Development: ​Expert-level knowledge of WebRTC / WebSockets for real-time media streaming. ​Demonstrated experience in developing Telegram Mini Apps (TMA). ​Professional mastery of FastAPI and React / Next.js. ​Strong understanding of the constraints and requirements of Low-latency systems.

    $1221 Average bid
    $1221 Avg Bid
    62 bids

    This project requires real GPU computation, correct Bitcoin cryptography handling, and verifiable results. This is not a demo or theoretical project. The program must be fully functional and tested. Only apply if you have proven experience with CUDA, cryptography, or Bitcoin key handling.

    $115 Average bid
    $115 Avg Bid
    1 bids

    ...modern GPUs and expose a clean, future-proof API for downstream applications. My end goal is to abstract away vendor-specific quirks so a data-scientist, graphics engineer, or researcher can tap into raw parallel power without worrying about whether the machine is running Windows, Linux, or macOS, or whether it ships with NVIDIA, AMD, or Intel silicon. You’re free to recommend the optimal blend of CUDA, ROCm, OpenCL, Vulkan, or even a custom compute layer—what matters is performance, portability, and clean code that’s easy to extend. I’m open to focusing on a single workload first (machine-learning kernels, real-time graphics effects, or heavy scientific simulations) if that helps us validate the core, then scaling outward. Deliverables I’m exp...

    $192 Average bid
    $192 Avg Bid
    37 bids

    Job Title: CUDA Developer Needed – GPU-Accelerated Bitcoin WIF Key Recovery Tool (Verification Required) Project Description: I am looking for an experienced CUDA / GPU developer to build and optimize a high-performance Bitcoin WIF private key recovery program. This project requires real GPU computation, correct Bitcoin cryptography handling, and verifiable results. This is not a demo or theoretical project. The program must be fully functional and tested. Only apply if you have proven experience with CUDA, cryptography, or Bitcoin key handling. Technical Requirements: - Written in C++ with CUDA - Runs on NVIDIA GPUs - Command-line interface (CLI) - Supports Bitcoin WIF (Base58Check) - Supports compressed and uncompressed private keys - Correct che...

    $171 Average bid
    $171 Avg Bid
    34 bids

    Lead AI / Fullstack Engineer — ...communication. ​Traffic Localization: Optimize routing protocols to maximize performance within the TAS-IX network. ​Candidate Requirements ​AI / ML Engineering: ​Proven experience with End-to-end (E2E) speech models (Moshi, AudioLM, or similar). ​Deep proficiency in PyTorch and Transformer architectures. ​Hands-on experience in Fine-tuning LLMs/S2S models for new language groups. ​Expertise in CUDA 12.x and NVIDIA optimization libraries. ​Fullstack Development: ​Expert-level knowledge of WebRTC / WebSockets for real-time media streaming. ​Demonstrated experience in developing Telegram Mini Apps (TMA). ​Professional mastery of FastAPI and React / Next.js. ​Strong understanding of the constraints and requirements of Low-latency systems.

    $4030 Average bid
    $4030 Avg Bid
    76 bids

    manual intervention. 3. Re-assemble processed frames back into a single clip using FFmpeg (or similar), ensuring temporal consistency—no flicker or dropped frames. 4. Expose a simple CLI command such as: python --input --output --strength 0.7 --seed 42 5. Provide a short README covering environment setup (Python, diffusers / transformers versions, CUDA requirements), example usage, and expected runtimes Acceptance criteria • The script completes a sample without errors and produces visibly live-action styling throughout. • Code is clean, commented, and includes a or environment.yml. Delivery: source code, README, and one converted sample clip produced by your wrapper.

    $57 Average bid
    $57 Avg Bid
    63 bids

    ...short written walkthrough covering hardware requirements, model parameters, and tips for further tuning. Acceptance criteria 1. Frame-by-frame identity preservation ≥ 95 % (verified with face-recognition scores). 2. No temporal flicker visible on 30-fps playback. 3. End-to-end generation time under 2× video length on a single high-end GPU. Tech stack keywords: PyTorch, TensorFlow, FFmpeg, CUDA, Google Colab, facial-landmark detection, GAN inversion. Roadmap beyond this delivery Once the core system is proven, I plan to expand into other AI-driven video features—scene synthesis, automated dubbing, even real-time object tracking—so clean, well-documented code is essential for future extension. Ready to start as soon as we agree on the approach, and ...

    $18 / hr Average bid
    $18 / hr Avg Bid
    33 bids

    ...- Auto-sync narration and visuals - Options: - voice selection (male/female) - narration speed - background music (optional) - subtitles (optional) Tech Requirements: - Must support OFFLINE mode (local machine) using open-source models (preferred) - Should also support ONLINE mode (server/cloud deployment) - Efficient pipeline (render without crashing) - Works on CPU + GPU if available (CUDA GPU preferred) Preferred Implementation (engineer decides exact tools): - Python backend (FastAPI preferred) - Local model inference pipeline - Video assembly using FFmpeg / MoviePy - Open-source TTS narration (example: XTTS, Piper, Coqui TTS, etc.) - Open-source image generation or whiteboard assets pipeline - LLM for storyboard/script breakdown (open-source model OR cheapest API) ...

    $235 Average bid
    $235 Avg Bid
    19 bids

    ...advise on the best hardware stack to achieve the 15-50 person real-time requirement: Cameras: Recommend specific sensors or cameras (e.g., Wide FOV, Global Shutter, or IR-capable for low-light/glare robustness). Processing Units: Advice on edge deployment. Can this run on a Raspberry Pi 5 with an AI Kit, or is an NVIDIA Jetson (Orin/Nano) required? If commodity GPUs are needed, specify minimum VRAM/Cuda core requirements. Kits: Recommend specific "plug-and-play" kits or enclosures suitable for the indoor/outdoor environment described. Final Deliverables Hardware Recommendation Report: Detailed list of suggested cameras, lenses, and processing kits (Raspberry Pi, Jetson, etc.) tailored to this specific use case. Source Code: Ready to plug into a Python environment ...

    $565 Average bid
    $565 Avg Bid
    62 bids

    I’m running ...trade-offs you introduce so I can reproduce and benchmark I already run basic line-profiler and torch-autograd checks, so I’m looking for deeper insights—vectorised ops, smarter batching, async data movement, or architectural tweaks I may have missed. Feel free to use tools like PyTorch Profiler, nvprof, or your preferred optimisers as long as the final instructions remain reproducible in a standard CUDA environment. If that sounds straightforward, let me know your availability and how you’d approach the first pass; I’m ready to share the repo right away.

    $14 / hr Average bid
    $14 / hr Avg Bid
    34 bids

    ..., Twilio or Meta API) OR a custom Mobile App (Flutter/React Native) for security staff. Dashboard: A simple web-based or local interface to view live logs, replay detected incidents, and manage sensitivity settings. Technical Requirements: Programming Language: Python. Frameworks: PyTorch, TensorFlow, OpenCV, YOLO (v8/v10), or MediaPipe. Hardware Compatibility: Must be optimized for NVIDIA CUDA cores / TensorRT. Scalability: The code should support multiple camera streams simultaneously. Deliverables: Full Source Code (well-documented). Setup Guide (How to install on the NVIDIA device and connect cameras). A working prototype/MVP demonstrating the detection of basic theft actions. Ideal Candidate: Proven experience in Computer Vision and Action Recognition. Previous ...

    $579 Average bid
    $579 Avg Bid
    98 bids

    I need a Windows-based GPU workstation dedicated to running local large-language-model workflows. I need someone who can walk me through the full setup—hardware , CUDA drivers, PyTorch/TensorFlow installs, plus the extra tools I rely on for text-to-video generation and similar AI workloads. Your first task is to get the machine fully operational: verify BIOS and power settings, install the latest GPU driver stack, configure CUDA/cuDNN, and deploy the core frameworks. From there we’ll layer in local-LLM utilities (e.g., , Ollama) alongside Stable Diffusion or any other video-generation packages I might explore. Clear, repeatable documentation of every step is essential so I can reproduce the environment later. Once the base system is stable, I’d like on...

    $22 / hr Average bid
    $22 / hr Avg Bid
    90 bids

    ...3), OCR (paddleocr 2.10.0 on paddlepaddle 3.0.0 / paddlepaddle-gpu 2.6.2), and post-processing with scikit-learn 1.6.0. Although one GPU-ready wheel is present, all processing still executes on the CPU. The goal is full NVIDIA CUDA utilisation across the entire workflow, from frame decoding to final inference. I need you to: • Profile the current code, pinpoint CPU-bound sections, and migrate or rewrite them for GPU execution (CUDA, CuDNN, cuBLAS, or other relevant CUDA-based APIs). • Update or swap libraries where necessary—feel free to recommend faster CUDA-compatible alternatives if they will not break accuracy (e.g., CuPy, TensorRT, NVIDIA Video Codec SDK). • Modify the code so GUI-less batch processing and real-time video run...

    $99 Average bid
    $99 Avg Bid
    60 bids

    ...spots a potential anomaly. All processing must happen in real time without introducing perceptible latency to the surgeon’s view. My current hardware outputs standard HDMI and records to DICOM, so your code should sit either between the camera head and the display (FPGA, GPU box, or high-performance PC is fine) or run as a software module on the workstation already attached to the scope. OpenCV, CUDA, TensorFlow, or similarly robust libraries are welcome—just keep licensing constraints clear. Deliverables • Executable or deployable source that enhances image clarity, performs real-time analysis, and triggers automated anomaly detection. • API or integration hooks so I can feed the processed stream back to my recording software. • A concise user gu...

    $299 Average bid
    $299 Avg Bid
    11 bids

    ...machine freeze during model training? Welcome to your new digital superpower. I bridge the gap between your ideas and the raw power of Microsoft Azure. I don’t just "rent servers"—I architect secure, high-performance environments so you can focus on building the future. What’s in my secret sauce? GPU Beasts: Access NVIDIA N-Series (V100, A10, T4) for AI/ML. Ready-to-Go Stack: I’ll pre-install CUDA, PyTorch, TensorFlow, or Docker. No more driver headaches! Fort Knox Security: Advanced Firewalls & private VPNs. Your VM stays invisible to the public web. Windows or Linux? I speak both. Whether you need an RDP or an SSH terminal, I’ve got you. The "Date Before You Marry" Trial Not sure if the speed is right? For just $5, ...

    $20 / hr Average bid
    $20 / hr Avg Bid
    36 bids

    ...with a brand-new RTX 5090 and need TensorRT installed, tuned and ready to accelerate Stream Diffusion inside TouchDesigner. I haven’t settled on a specific release yet, so I’ll rely on your guidance to pick the most stable, future-proof version (including matching CUDA and cuDNN builds) for this GPU. Here’s what I expect: • Recommend the best TensorRT version for an RTX 5090 Windows environment and explain why it’s the right fit. • Handle the full installation: download packages, configure environment variables, and verify driver / CUDA compatibility. • Prove the install works by running a sample inference, then confirm TouchDesigner can see the TensorRT engine for Stream Diffusion. • Leave me with a concise, step-by-step recap ...

    $74 Average bid
    $74 Avg Bid
    11 bids

    ...a brand-new RTX 5090 and need TensorRT installed, tuned and ready to accelerate Stream Diffusion inside TouchDesigner. I haven’t settled on a specific release yet, so I’ll rely on your guidance to pick the most stable, future-proof version (including matching CUDA and cuDNN builds) for this GPU. Here’s what I expect: • Recommend the best TensorRT version for an RTX 5090 Windows environment and explain why it’s the right fit. • Handle the full installation: download packages, configure environment variables, and verify driver / CUDA compatibility. • Prove the install works by running a sample inference, then confirm TouchDesigner can see the TensorRT engine for Stream Diffusion. • Leave me with a concise, step-by-step rec...

    $161 Average bid
    $161 Avg Bid
    20 bids

    Looking for developer who can work on below requirment . Lead design and implementation of GPU computers for deep learning; optimize algorithms and mentor team Must have key skills cuda,c++,Gpu Programming Other key skills Parallel Computing,Opengl,Opencl Job description What you’ll do CUDA is a must JD For Senior / Lead Engineer (HPC GPU):- As a Senior / Team Lead (HPC) you will provide leadership in designing and implementing groundbreaking GPU computers that run demanding deep learning, high-performance computing, and computationally intensive workloads. We seek an expert to identify architectural changes and/or completely new approaches for accelerating our deep learning models. As an expert, you will help us with the strategic challenges we encounter, includi...

    $1205 Average bid
    $1205 Avg Bid
    72 bids

    ...virtually no perceptible delay. The tool must lock on to faces accurately, track expressions, match lighting and color, and render the composite at a stable frame rate suitable for streaming or studio recording. The core pipeline should include high-resolution face detection, landmark tracking, real-time inference with a modern GAN or transformer model, and seamless blending. Feel free to lean on CUDA-accelerated TensorFlow or PyTorch, OpenCV for image I/O, and any efficient post-processing libraries you trust—what matters is rock-solid performance and visual fidelity. I want the interface to be simple: a preview window, a slot to load or capture the target face, quick toggles to enable/disable tracking, and an option to record or pipe the output to a virtual camera devic...

    $239 Average bid
    $239 Avg Bid
    11 bids

    ...deliver must install and operate smoothly in that environment without the usual Linux-only work-arounds that most Jetson guides assume. Here is what I need from you: build a lightweight, fully-functional miner that recognizes the Jetson Nano’s CUDA-capable GPU, connects to any standard Bitcoin pool I specify, and begins hashing immediately after a one-time setup wizard. The setup flow should auto-detect the board’s hardware, prompt for the pool URL, wallet address, and worker name, then save those settings for future boots. Key technical expectations • CUDA acceleration out of the box—no manual library hunting. • Clean, single-click installer for Windows 11 on ARM. • Real-time dashboard showing hash rate, accepted/rejected shares, p...

    $503 Average bid
    $503 Avg Bid
    42 bids

    I have a project that should work with ComfyUI / WAN2 set-up and now need to turn it into approximately 15–20 minutes of finished, classroom-ready video. We have text...• Final MP4s play without glitches on standard players • Everything is handed over within the agreed, ASAP timeline • ComfyUI API We have GPU Server ready with following config Server Configuration Intel Dual XEON E5-2697v4 CPU Cores: 18 RAM: 256GB DDR4 GPU: 3 x Nvidia Quadro RTX A5000 (3GPU) STORAGE: 240GB SSD (Boot) + 2TB NVMe + 8TB SATA (10TB) GPU Specifications Microarchitecture: Ampere CUDA Cores: 10,752 Tensor Cores: 336 GPU Memory: 24GB GDDR6 FP32 Performance: 38.71 TFLOPS If you already work with ComfyUI or similar AI video pipelines and can hit these language requirements quickly, l...

    $230 Average bid
    $230 Avg Bid
    7 bids

    ...training a convolutional neural network and now I want it running reliably on an AWS EC2 instance. I already have an AWS account and am settled on using EC2 rather than SageMaker or Lambda, so the task is purely about standing up the production environment and proving that the model answers live requests. Here’s what I need: • Spin up and configure an EC2 instance (Ubuntu preferred) with GPU drivers, CUDA / cuDNN, Python, and either TensorFlow or PyTorch—whichever my model requires. • Package the model (saved .h5 or .pt plus any preprocessing code) into a lightweight service—Flask, FastAPI, or another simple REST interface is fine. • Expose a secure HTTPS endpoint behind an AWS load balancer or an Nginx reverse proxy so I can hit /predict wi...

    $14 Average bid
    $14 Avg Bid
    4 bids

    Top cuda Community Articles