Full-Stack & Multimodal AI Engineer (Text · Voice · Image · Video)
Copied!
Contact Feature
hourly Rate
15.00 USD
1.00 USD
Processing Fees
1.00 USD
What people loved about this seller
Description

I am AI Engineer and
Full-Stack Developer with deep expertise in building intelligent systems across
text, voice, image, and video modalities. Skilled in designing production-grade
AI applications—from chatbots and multimodal assistants to automation pipelines
and computer-vision systems. Strong foundation in LLMs, RAG, fine-tuning, and
scalable web/mobile development, with a proven ability to integrate AI into
real-world products.

 

Skills

AI/ML & NLP

  • LLMs, Chat AI, NLP, text
    generation & summarization
  • RAG, vector databases,
    dataset labeling
  • Fine-tuning, LangChain,
    LangGraph
  • Sentiment analysis,
    classification, NER, topic modeling

Multimodal AI

  • Text-to-Image (DALL·E,
    Midjourney, SDXL)
  • Text-to-Video (Sora, Runway,
    Pika Labs)
  • Image-to-Video (Runway,
    Stable Video Diffusion)
  • Speech-to-Text (Whisper,
    AssemblyAI), Text-to-Speech (GPT-TTS, Murf.ai)
  • Voice cloning, audio
    enhancement, podcast cleanup

Computer Vision

  • Object detection, face
    recognition
  • Medical imaging analysis
  • CCTV analytics, pose
    estimation
  • Robotics perception for
    navigation and manipulation

Software & Full-Stack Development

  • React.js, Next.js
  • Node.js
  • Python
  • Web development, mobile
    development
  • AI integration &
    automation pipelines

 

Business Focus

  • Experience creating
    AI-powered tools that optimize support, reduce costs, and automate
    workflows.
  • Deep understanding of
    integrating AI into business environments such as customer service,
    operations, logistics, marketing, legal, and research.
  • Ability to design systems
    that improve decision-making, streamline processes, and increase ROI
    through intelligent automation.

 

Experience

  • Chatbots (support bots, FAQ,
    internal assistants)
  • Voice agents for call
    centers, assistants, and IVR systems
  • Multimodal agents combining
    text, voice, and images
  • Image generation apps, video
    generation tools, avatar creation, photo editing
  • AI music generation workflows
  • Smart meeting transcribers
    and podcast/audio cleanup systems
  • PDF Q&A and document
    understanding solutions
  • Company knowledge assistants
    and domain-specific RAG systems
  • Search + answer engines and
    research agents
  • Coding agents,
    email/automation agents, web-browsing agents
  • CV systems for robotics,
    drones, warehouses, and surveillance
  • AI monitoring, analytics, and
    model hosting setups

 

Others

  • Strong background in web
    scraping for data pipelines
  • Experience building
    automation systems across cloud and on-prem
  • Capable of designing
    scalable, production-ready AI architectures
  • Cross-platform development
    skills for both web and mobile apps

About the seller
jacob777999

jacob777999

Seller

Not rated yet

From

United Kingdom

Last Seen

3 weeks ago

Member Since

December 11, 2025

Instructions

I am AI Engineer and Full-Stack Developer with deep expertise in building intelligent systems across text, voice, image, and video modalities. Skilled in designing production-grade AI applications—from chatbots and multimodal assistants to automation pipelines and computer-vision systems. Strong foundation in LLMs, RAG, fine-tuning, and scalable web/mobile development, with a proven ability to integrate AI into real-world products.

Skills
AI/ML & NLP
• LLMs, Chat AI, NLP, text generation & summarization
• RAG, vector databases, dataset labeling
• Fine-tuning, LangChain, LangGraph
• Sentiment analysis, classification, NER, topic modeling
Multimodal AI
• Text-to-Image (DALL·E, Midjourney, SDXL)
• Text-to-Video (Sora, Runway, Pika Labs)
• Image-to-Video (Runway, Stable Video Diffusion)
• Speech-to-Text (Whisper, AssemblyAI), Text-to-Speech (GPT-TTS, Murf.ai)
• Voice cloning, audio enhancement, podcast cleanup
Computer Vision
• Object detection, face recognition
• Medical imaging analysis
• CCTV analytics, pose estimation
• Robotics perception for navigation and manipulation
Software & Full-Stack Development
• React.js, Next.js
• Node.js
• Python
• Web development, mobile development
• AI integration & automation pipelines

Business Focus
• Experience creating AI-powered tools that optimize support, reduce costs, and automate workflows.
• Deep understanding of integrating AI into business environments such as customer service, operations, logistics, marketing, legal, and research.
• Ability to design systems that improve decision-making, streamline processes, and increase ROI through intelligent automation.

Experience
• Chatbots (support bots, FAQ, internal assistants)
• Voice agents for call centers, assistants, and IVR systems
• Multimodal agents combining text, voice, and images
• Image generation apps, video generation tools, avatar creation, photo editing
• AI music generation workflows
• Smart meeting transcribers and podcast/audio cleanup systems
• PDF Q&A and document understanding solutions
• Company knowledge assistants and domain-specific RAG systems
• Search +

Booking
Milestones
FAQ
What is your nickname

12345678

Audio
Preview
Map
Additional Details
Order Additional
hourly Rate
15.00 USD
1.00 USD
Processing Fees
1.00 USD
Feedback
This job has no reviews.
hourly Rate
15.00 USD
Processing Fees
1.00 USD
hourly Rate
15.00 USD
1.00 USD
Processing Fees
1.00 USD


  • You may share your affiliate link on websites, forums, social networks, blogs or articles.
  • Anyone who clicks this link will be tagged with your cookie and you will make 10% of whatever they buy on Zeerk.
  • You can even just send friends to the Zeerk home page and get 10% on anything they stumble upon and buy!
  • >> Referral URL Generator

Related Topics

Views: 47 Gig views updated hourly

Other Gigs by jacob777999
Sorry, there are no posted jobs yet.