databricks

JSON twin: https://www.healthaidb.com/software/databricks.json

Company Name

Databricks

Product URL

https://www.databricks.com/product/data-intelligence-platform

Company URL

https://www.databricks.com/

Categories

Summary

Databricks provides a unified data intelligence Lakehouse platform used by healthcare and life sciences organizations to unify data, run analytics, and build/deploy AI and ML at scale.

Description

A cloud-native data and AI platform (Lakehouse) that unifies structured and unstructured healthcare data, supports ETL/streaming, governance (Unity Catalog), Delta Lake transactionality, solution accelerators for HL7/FHIR and biomedical LLMs, marketplace data sharing (Delta Sharing) and tools for data engineering, data science, ML/GenAI and analytics across cloud providers (AWS, Azure, GCP).

Api Available

yes

Certifications

Company Founding

2013

Company Offices

Compliance

Customers

Data Residency

Region selection by cloud provider (customer chooses AWS/Azure/GCP region / BYO cloud region)

Data Standards

Deployment Model

Features

Id

P0447

Integration Partners

Integrations

Languages Supported

Last Updated

2025-09-07

License

commercial / proprietary

Links

Market Segment

Optional Modules

Os Platforms

Pricing Details

Tiered enterprise pricing based on cloud consumption (DBUs) and compute/storage; free trial/Free Edition for learning; contact vendor for quotes and enterprise plans.

Pricing Model

enterprise_quote

Privacy Features

Ratings

Regions Available

Release Year

2013

Security Features

Specialties

Support Channels

System Requirements

Target Users

Training Options

Type

product

User Reviews

Version

1.0

Canonical JSON

{
  "company_name": "Databricks",
  "company_url": "https://www.databricks.com/",
  "company_offices": [
    "United States",
    "United Kingdom",
    "Brazil",
    "Costa Rica",
    "Spain",
    "Saudi Arabia",
    "Denmark",
    "Serbia",
    "Canada",
    "India"
  ],
  "company_founding": "2013",
  "product_url": "https://www.databricks.com/product/data-intelligence-platform",
  "categories": [
    "data engineering",
    "data science",
    "artificial intelligence",
    "business intelligence",
    "data management",
    "data warehousing",
    "application development",
    "healthcare & life sciences"
  ],
  "market_segment": [
    "enterprise",
    "smb"
  ],
  "links": [
    "https://www.databricks.com/",
    "https://www.databricks.com/product/data-intelligence-platform",
    "https://www.databricks.com/solutions/industries/healthcare-and-life-sciences",
    "https://docs.databricks.com/en/index.html",
    "https://www.databricks.com/trust",
    "https://www.databricks.com/product/pricing",
    "https://www.databricks.com/company/careers",
    "https://www.g2.com/products/databricks-data-intelligence-platform/reviews",
    "https://www.databricks.com/company/about-us",
    "https://www.databricks.com/customers"
  ],
  "summary": "Databricks provides a unified data intelligence Lakehouse platform used by healthcare and life sciences organizations to unify data, run analytics, and build/deploy AI and ML at scale.",
  "description": "A cloud-native data and AI platform (Lakehouse) that unifies structured and unstructured healthcare data, supports ETL/streaming, governance (Unity Catalog), Delta Lake transactionality, solution accelerators for HL7/FHIR and biomedical LLMs, marketplace data sharing (Delta Sharing) and tools for data engineering, data science, ML/GenAI and analytics across cloud providers (AWS, Azure, GCP).",
  "target_users": [
    "data scientists",
    "data engineers",
    "clinical researchers",
    "biostatisticians",
    "healthcare analysts",
    "IT / cloud administrators",
    "R&D teams (pharma/biotech)",
    "BI analysts",
    "product managers",
    "executives"
  ],
  "specialties": [
    "clinical analytics",
    "real-world evidence / RWE",
    "pharmaceutical R&D",
    "genomics / bioinformatics",
    "population health",
    "medical imaging analytics",
    "operational/financial analytics",
    "clinical trial analytics",
    "supply chain forecasting",
    "biomedical literature retrieval"
  ],
  "regions_available": [
    "United States",
    "Canada",
    "United Kingdom",
    "European Union",
    "Australia",
    "India",
    "Japan",
    "Singapore",
    "Brazil",
    "Mexico"
  ],
  "languages_supported": [
    "English",
    "Spanish",
    "French",
    "German",
    "Japanese",
    "Chinese (Simplified)"
  ],
  "pricing_model": "enterprise_quote",
  "pricing_details": "Tiered enterprise pricing based on cloud consumption (DBUs) and compute/storage; free trial/Free Edition for learning; contact vendor for quotes and enterprise plans.",
  "license": "commercial / proprietary",
  "deployment_model": [
    "SaaS (cloud)",
    "cloud-hosted by customer on AWS",
    "cloud-hosted by customer on Azure",
    "cloud-hosted by customer on GCP",
    "hybrid (customer-managed cloud + Databricks control plane)"
  ],
  "os_platforms": [
    "Web (browser UI)",
    "Linux (compute clusters)",
    "Windows (clients/tools)",
    "macOS (clients/tools)"
  ],
  "features": [
    "Unified Lakehouse (Delta Lake)",
    "Data engineering (ETL/streaming)",
    "Data warehousing / Databricks SQL",
    "Data science workspaces / notebooks",
    "Machine learning / MLOps",
    "Generative AI and LLM support",
    "Data governance (Unity Catalog)",
    "Delta Sharing (secure data sharing)",
    "Real-time streaming, event pipelines",
    "Marketplace & solution accelerators",
    "Collaborative notebooks and IDE integrations",
    "Job scheduling and orchestration"
  ],
  "optional_modules": [
    "Unity Catalog (governance)",
    "Delta Sharing (data sharing)",
    "Databricks SQL (serverless warehouse)",
    "Databricks Apps (application development)",
    "MarketPlace / Partner Connect",
    "Lakehouse-specific solution accelerators"
  ],
  "integrations": [
    "Redox (healthcare interoperability partner)",
    "IQVIA (life sciences data partner)",
    "AWS (cloud provider)",
    "Azure (cloud provider)",
    "Google Cloud (cloud provider)",
    "SAP (partner/cloud integration)",
    "Partner Connect ecosystem (technology partners)",
    "Third-party IDEs and BI tools via connectors"
  ],
  "data_standards": [
    "FHIR",
    "HL7 v2",
    "HL7 FHIR streaming/pipelines (via accelerators)",
    "CSV/JSON/Parquet/Delta formats"
  ],
  "api_available": "yes",
  "system_requirements": "",
  "compliance": [
    "HIPAA (supports HIPAA workloads / BAA)",
    "GDPR",
    "SOC 2",
    "ISO 27001"
  ],
  "certifications": [
    "SOC 2 Type II",
    "ISO 27001"
  ],
  "security_features": [
    "Encryption at rest and in transit",
    "Role-based access control (RBAC)",
    "Single sign-on / SAML / OIDC",
    "Audit logging / event logs",
    "Customer-managed keys / KMS integration"
  ],
  "privacy_features": [
    "Business Associate Agreement (BAA) available",
    "Data anonymization / de-identification tooling (via partners/accelerators)",
    "Access controls and consent-supporting controls"
  ],
  "data_residency": "Region selection by cloud provider (customer chooses AWS/Azure/GCP region / BYO cloud region)",
  "customers": [
    "CVS Health",
    "Walgreens",
    "SCAN Health Plan",
    "Accolade",
    "Austin Health",
    "Flo Health",
    "Verana Health",
    "Milliman",
    "Ensemble Health Partners",
    "OMRON Healthcare",
    "Orizon",
    "Ribbon Health"
  ],
  "user_reviews": [
    "Great for getting Apache Spark up and running quickly, but DBU costs add up and you need to tune for large jobs.",
    "Excellent platform for unifying data engineering, analytics, and ML — MLflow integration is very useful.",
    "Cluster spin-up times can be excessive and cost control is challenging.",
    "Powerful for big data and machine learning, but has steep learning curve and some proprietary features.",
    "Makes collaborative data science easier with notebooks and model registry, though debugging and performance tuning can be painful.",
    "Good integration with cloud providers, but reliance on a single vendor can be a risk if outages occur.",
    "Strong for productionizing ML, but support/response times can vary during peak periods.",
    "Provides solid scalability and features for enterprises, though smaller teams may find it expensive.",
    "Improved productivity for data teams, yet permissions and governance setup can be complex.",
    "Notebooks and collaborative workflows are a big plus; cost and cluster management remain common complaints."
  ],
  "ratings": [
    "G2: 4.6/5 (product reviews)",
    "Capterra: 4.8/5 (user reviews)",
    "Gartner Peer Insights: 4.5/5 (user reviews)"
  ],
  "support_channels": [
    "email",
    "phone",
    "chat",
    "ticketing",
    "community forum",
    "knowledge base",
    "account management"
  ],
  "training_options": [
    "documentation",
    "webinars",
    "live_online",
    "onsite training",
    "certification",
    "academy courses"
  ],
  "release_year": "2013",
  "integration_partners": [
    "Microsoft Azure",
    "AWS",
    "Google Cloud Platform",
    "Datavant",
    "Ribbon Health",
    "Avanade",
    "Synapxe",
    "Tableau",
    "Power BI",
    "Fivetran",
    "Confluent",
    "Looker",
    "Snowflake",
    "Salesforce"
  ],
  "id": "P0447",
  "slug": "databricks",
  "type": "product",
  "version": "1.0",
  "last_updated": "2025-09-07",
  "links_json": {
    "self": "https://www.healthaidb.com/software/databricks.json"
  }
}