
External reviews

External reviews are not included in the AWS star rating for the product.
It was fantastic for all the integrations it had. Custom ml transform support could have been better
What do you like best about the product?
The fact that you can store all your models at one place
What do you dislike about the product?
The support for custom transforms isn't there
What problems is the product solving and how is that benefiting you?
We wanted a place to store all models at one place with versioning. Many benefits like serving the model are offered out of the box through databricks
Recommendations to others considering the product:
If you want to store all your ml models in one place then it's the way to go.
- Leave a Comment |
- Mark review as helpful
Perfect code sharing repository
What do you like best about the product?
Having a platform to share codebase with team members and run machine learning models on the cloud.
What do you dislike about the product?
Sometimes we have to restart clusters to fix memory errors, which leads to data loss.
What problems is the product solving and how is that benefiting you?
Collaboration on code among team members. Running applications on the cloud.
Brilliant on developing the best collaborative platform for data scientists and data engineers
What do you like best about the product?
An interface that is better than Jupyter notebooks that allows SQL, Scala, PySpark, Python, R and the ability to collabortate on notebooks
What do you dislike about the product?
DPU based billing is fixed and minimum is 3 node cluster. For a small entity the advantages of using AWS Glue interface to Spark 2.x outweighs the benefits of a permenant cluster runnig with Databricks.
What problems is the product solving and how is that benefiting you?
Big data management in lake type architecture using Parquet formats, PySpark developments and enhancements.
Databricks is the best option for your data workloads and pipelines
What do you like best about the product?
It is a highly adaptable solution for data engineering, data science, and AI
What do you dislike about the product?
I wouldn't say I like the lack of an easier way to import personalized code files or libraries from notebooks.
What problems is the product solving and how is that benefiting you?
I've solved emergency telephone data processing and insights. The performance of the solution is desirable.
Senior Cloud Evangelist and Architect
What do you like best about the product?
Spark Distribution of query and speed of batch query so does performance
What do you dislike about the product?
Interface can be make better and more intutive
What problems is the product solving and how is that benefiting you?
Big Batch bulk Parallel programming
Great platform for our Big Data needs
What do you like best about the product?
Easy administration, easy to create jobs from notebooks, great development environment, new and exciting features coming.
What do you dislike about the product?
Taking away our dedicated customer service rep and replacing this with just a support GUI.
What problems is the product solving and how is that benefiting you?
All our data pipelines are on Databricks. Benefitted from improved performance on Spark.
Staging data for insights
What do you like best about the product?
It makes the power of Spark accessible and innovative solutions like Delta Lake.
What do you dislike about the product?
Fewer solutions that aren't wholly or partially on the cloud.
What problems is the product solving and how is that benefiting you?
We are staging large datasets for reporting and multiple BI solutions.
Best tool for big data
What do you like best about the product?
Easy to use multiple languages based command in same notebook. Direct connection to Redshift.
What do you dislike about the product?
Sometime it takes lot of time to load data. Should show better suggestions.
What problems is the product solving and how is that benefiting you?
We are using databricks to analyse big data and get business insights.
One stop shop for all your data problems
What do you like best about the product?
It has got everything in it. IDE, Version Control, Scheduling whatnot.
What do you dislike about the product?
I didn't find something that discomforts me yet.
What problems is the product solving and how is that benefiting you?
Currently, I'm using it as an ETL tool. It's easy to use and connects with any data source—excellent documentation and help from the community.
Recommendations to others considering the product:
Just go for it. You can do many things you want to do with your data.
Very powerful yet easy to use distributed computing and data warehousing platform
What do you like best about the product?
Databricks had very powerful distributed computing built in with easy to deploy optimized clusters for spark computations. The notebooks with MLFlow integration makes it easy to use for Analytics and Data Science team yet the underlying APIs and CICD integrations make it very customizable for the Data Engineers to create complex automated data pipelines. Ability to store and query and manipulate massive Spark SQL tables with ACID in Delta Lake makes big data easily accessible to all in the organization.
What do you dislike about the product?
It lacks built in data backup features and ability to restrict data access to specific users. So if anyone accidentally deletes data from Delta Table or DBFS, the lost data cannot be retrieved unless we setup our own customized backup solution.
What problems is the product solving and how is that benefiting you?
I have worked with big data with hundreds of millions of rows using databricks. We do most of the ELT, data cleaning and prepping works on databricks. The ease and speed of querying bid data using databricks SparkSQL is very useful. It is also very easy to create prototype codes utilizing real sized data using the available Python and R notebooks.
showing 431 - 440