• Databricks
  • Databricks
  • Support
  • Feedback
  • Try Databricks
  • Help Center
  • Documentation
  • Knowledge Base
Databricks on AWS

Get started

  • Get started
  • What is Databricks?
  • Tutorials and best practices
  • Release notes

Load & manage data

  • Load data
  • Explore data
  • Prepare data
  • Share data (Delta sharing)

Work with data

  • Data engineering
  • Machine learning
  • Data warehousing
  • Delta Lake
  • Developer tools
  • Technology partners

Administration

  • Account and workspace administration
  • Security and compliance
  • Data governance

Reference & resources

  • Reference
    • Python, SparkR & Scala intros
      • Python
      • SQL
      • R
        • SparkR overview
        • SparkR ML tutorials
          • Use glm
        • SparkR function reference
        • sparklyr
        • Comparing SparkR and sparklyr
        • Work with DataFrames and tables in R
        • RStudio on Databricks
        • Shiny on Databricks
        • renv on Databricks
      • Scala
      • UDFs
      • pandas
    • REST API Explorer (Beta)
    • REST API
    • MLFlow API
    • Feature Store Python API
    • Apache Spark API
    • Delta Lake API
    • Delta Live Tables API
    • Databricks SQL API
    • SQL language reference
  • Resources
  • What’s coming?
  • Documentation archive

Updated Mar 29, 2023

Send us feedback

  • Documentation
  • Databricks reference documentation
  • Language-specific introductions to Databricks
  • Databricks for R developers
  • SparkR ML tutorials

SparkR ML tutorials

  • Use glm
    • Load diamonds data and split into training and test sets
    • Train a linear regression model using glm()
    • Train a logistic regression model using glm()


© Databricks 2023. All rights reserved. Apache, Apache Spark, Spark, and the Spark logo are trademarks of the Apache Software Foundation.

Send us feedback | Privacy Policy | Terms of Use