DataStax, Inc. is a real-time data for AI company based in Santa Clara, California.[3] Its product Astra DB is a cloud database-as-a-service based on Apache Cassandra. DataStax also offers DataStax Enterprise (DSE), an on-premises database built on Apache Cassandra, and Astra Streaming, a messaging and event streaming cloud service based on Apache Pulsar. As of June 2022, the company has roughly 800 customers distributed in over 50 countries.[4][5][2]

DataStax
Company typePrivate
IndustryDatabase Technologies
GenreMulti-Model DBMS
FoundedApril 2010
Austin, TX, USA
Founder
  • Jonathan Ellis
  • Matt Pfeil
Headquarters,
United States
Key people
Chet Kapoor[1] (CEO)
Davor Bonaci (CTO)
Ed Anuff (CPO)
Don Dixon (CFO)
Brad Gyger (CRO)
Jason McClelland (CMO)
Chris Vogel (Chief People Officer)
Number of employees
800+ (June 2022)[2]
WebsiteDataStax.com

History edit

DataStax was built on the open source NoSQL database Apache Cassandra. Cassandra was initially developed internally at Facebook to handle large data sets across multiple servers,[6] and was released as an Apache open source project in 2008.[7] In 2010, Jonathan Ellis and Matt Pfeil left Rackspace, where they had worked with Cassandra, to launch Riptano in Austin, Texas.[6][8] Ellis and Pfeil later renamed the company DataStax, and moved its headquarters to Santa Clara, California.[3][9]

The company went on to create its own enterprise version of Cassandra, a NoSQL database called DataStax Enterprise (DSE).[6]

In 2019, Chet Kapoor was named the company's new CEO, taking over from Billy Bosworth.[10]

 
Original logo

In May 2020, DataStax released Astra DB, a DBaaS for Cassandra applications.[11] In November 2020, DataStax released K8ssandra, an open source distribution of Cassandra on Kubernetes.[12] In December 2020, DataStax released Stargate, an open source data API gateway.[13]

After acquiring streaming event vendor Kesque in January 2021,[14] the company launched Luna Streaming, a data streaming platform for Apache Pulsar.[15] DataStax then rebuilt the Kesque technology into Astra Streaming.[16] The Astra Streaming cloud service became generally available on June 29, 2022.[17] With the release, the company added API-level support for messaging tools Apache Kafka, RabbitMQ and Java Message Service, in addition to Apache Pulsar.[18][19] Astra Streaming can connect to a larger data platform by utilizing DataStax’s Astra DB cloud service.[18]

Starting in 2023, DataStax began incorporating artificial intelligence and machine learning into its platform.[20] In January 2023, the company acquired Kaskada, developer of a platform that helps organizations use data for AI applications.[21] DataStax made the formerly proprietary Kaskada technology open source, and integrated it into its Luna ML service, which was launched on May 4, 2023.[22] With the acquisition, former Kaskada CEO Davor Bonaci was named DataStax chief technology officer and executive vice president.[22]

On May 24, 2023, DataStax announced that it would be partnering with ThirdAI to bring large language models to DSE and AstraDB, to help developers develop generative AI applications.[23]

In June 2023, the company announced the development of a GPT-based schema translator in its Astra Streaming cloud service. The Astra Streaming GPT Schema Translator uses generative AI to automatically generate schema mappings, to enable data integration and interoperability between multiple systems and data sources.[24]

On July 18, 2023, the company announced a partnership with Google to make semantic search available in its Astra DB cloud database for developers building generative AI applications.[20]

On September 13, 2023, DataStax launched the LangStream open source project, which works with Astra DB and supports vector databases including Milvus and Pinecone. LangStream enables developers to better work with streaming data sources, using Apache Kafka technology and generative AI to help build event-driven architectures.[25]

In November 2023, DataStax announced RAGStack, a simplified commercial offering for RAG (retrieval-augmented generation) based on LangChain and Astra DB vector search.[26]

Products edit

Astra DB edit

Astra DB is available on cloud services such as Microsoft Azure, Amazon Web Services, and Google Cloud Platform.[27] In February 2021, DataStax announced the serverless version of Astra DB, offering developers pay-as-you-go data.[28]

In March 2022, DataStax introduced new change data capture (CDC) capabilities to its Astra DB cloud service. Astra DB CDC is powered by Apache Pulsar, which allows developers to manage operational and streaming data in one place.[29] DataStax leads the open-source Starlight, which provides a compatibility layer for different protocols on top of Apache Pulsar.[18]

On February 8, 2023, DataStax launched Astra Block, a cloud-based service based on the Ethereum blockchain to support building Web3 applications, available as part of Astra DB. Astra Block can be used by developers to stream enhanced data from the Ethereum blockchain to build or scale Web3 experiences on Astra DB.[30]

Astra DB supports open source LangChain technology, making it easier for developers to create generative AI applications.[20]

DSE edit

Version 1.0 of the DataStax Enterprise (DSE), released in October 2011, was the first commercial distribution of the Cassandra database, designed to provide real-time application performance and heavy analytics on the same physical infrastructure.[31][32] It grew to include advanced security controls, graph database models, operational analytics and advanced search capabilities.[33]

In April 2016, the company announced the release of DataStax Enterprise Graph, adding graph data model functionality to DSE.[34]

In March 2017, DataStax announced the release of its DSE platform 5.1, which included improved search capabilities, improved security control, improvements to its Graph data management and improvements to operational analytics performance. DataStax also announced a shift in strategy, with an added focus on customer experience applications. Rather than a new set of technologies, the company started to offer advice on best practice to users of its core DSE platform.[35][33]

In April 2018, DataStax released DSE 6, with the new version focused on businesses using a hybrid cloud computing model, with all the benefits of a distributed cloud database on any public cloud or on-premise, twice the responsiveness and ability to handle twice the throughput.[36][37]

In December 2018, DataStax released DSE 6.7, which offers enterprise customers five key new feature upgrades, including: improved analytics, geospatial search, improved data protection in the cloud, enhanced performance insights and new developer integration tools with Apache Kafka Connector and certified production Docker images.[38]

In April 2020, DataStax released DSE 6.8, offering enterprises new capabilities for bare-metal performance and to support more workloads, and serving as a Kubernetes operator for Cassandra.[39]

DSE 7.0 was introduced in August 2023. It offers enhancements in cloud-native operations and generative AI capabilities, and includes vector search.[40]

Funding and IPO edit

In September 2014, DataStax raised $106 million in a Series E funding round, raising the total investment in the company to $190 million.[3] On June 15, 2022, the company announced it had raised an additional $115 million, at a $1.6 billion valuation.[2][41]

In 2020, Mergermarket reported that DataStax was preparing for an initial public offering that could launch in 2021.[42] However, in June 2022, DataStax CEO Chet Kapoor said that the company would not rush into an IPO.[2]

See also edit

References edit

  1. ^ "Announcing Our New CEO".
  2. ^ a b c d "Cassandra vendor DataStax secures $115m investment for $1.6b valuation". theregister.com. Retrieved August 8, 2022.
  3. ^ a b c Gage, Deborah (4 September 2014). "DataStax Raises $106 Million in New Pre-IPO Round, Chips Away at Oracle". Wall Street Journal.
  4. ^ Banks, Martin (6 October 2017). "DataStax adds Oracle to provide practical collaboration". Diginomica.com.
  5. ^ Clancy, Heather (14 April 2015). "DataStax just scored a big partnership with HP. Here's why". Fortune.
  6. ^ a b c "OUT IN THE OPEN: THE ABANDONED FACEBOOK TECH THAT NOW HELPS POWER APPLE". Wired. 4 August 2014. Retrieved 18 September 2017.
  7. ^ Jackson, Joab (18 October 2011). "Apache Cassandra Ready for the Enterprise". CIO. Archived from the original on 6 September 2018. Retrieved 5 September 2018.
  8. ^ Clark, Don (26 October 2010). "Start-Up Riptano Predicts Success With Cassandra Database". Wall Street Journal.
  9. ^ Harris, Derrick (4 September 2014). "NoSQL is growing up, and DataStax just raised $106M to prove it". gigaom.com.
  10. ^ "Former Google VP Chet Kapoor joins DataStax as CEO". siliconangle.com. 22 October 2019. Retrieved February 22, 2021.
  11. ^ "Cassandra Now Officially In the Cloud with Datastax Astra". datanami.com. 12 May 2020. Retrieved February 26, 2021.
  12. ^ "DataStax unveils K8ssandra as cloud-native Cassandra". zdnet.com. Retrieved February 26, 2021.
  13. ^ "Meet Stargate, DataStax's GraphQL for databases". zdnet.com. Retrieved February 26, 2021.
  14. ^ "DataStax enters event streaming market with Apache Pulsar". techtarget.com. Retrieved August 8, 2022.
  15. ^ "DataStax acquires Kesque". techcrunch.com. Retrieved February 26, 2021.
  16. ^ "DataStax cofounder on evolving Cassandra for modern workloads". venturebeat.com. Retrieved August 8, 2022.
  17. ^ "DataStax Astra gets support for Kafka, RabbitMQ and JMS in bid to capture the 'full data story'". diginomica.com. 2022-06-29. Retrieved 2023-03-30.
  18. ^ a b c "DataStax extends Astra Streaming event data platform". techtarget.com. Retrieved August 8, 2022.
  19. ^ "DataStax Astra gets support for Kafka, RabbitMQ and JMS in bid to capture the 'full data story'". diginomica.com. Retrieved August 8, 2022.
  20. ^ a b c "DataStax brings vector database search to multicloud with Astra DB". venturebeat.com. Retrieved December 1, 2023.
  21. ^ "AI feature engineering is focus as DataStax acquires Kaskada". venturebeat.com. Retrieved December 1, 2023.
  22. ^ a b "DataStax extends AI feature engineering with Luna ML". venturebeat.com. Retrieved December 1, 2023.
  23. ^ "DataStax taps ThirdAI to bring generative AI to its database offerings". infoworld.com. Retrieved December 1, 2023.
  24. ^ "DataStax Plumbs AI Into Smarter Data Pipelines". forbes.com. Retrieved December 1, 2023.
  25. ^ "DataStax takes aim at event-driven AI with open source LangStream project". venturebeat.com. Retrieved December 1, 2023.
  26. ^ "With RAGStack, DataStax enables generative AI models to gain additional context from third-party data". siliconangle.com. Retrieved December 1, 2023.
  27. ^ "DataStax offers serverless, NoSQL Astra DB across multiple regions, clouds". infoworld.com. Retrieved August 8, 2022.
  28. ^ "DataStax Astra serverless DBaaS optimizes deployments". techtarget.com. Retrieved August 8, 2022.
  29. ^ "DataStax CEO: Every use case doesn't need a new database". infoworld.com. Retrieved August 8, 2022.
  30. ^ "DataStax launches Astra Block to support Web3 applications". infoworld.com. Retrieved December 1, 2023.
  31. ^ Cohan, Peter (24 Nov 2017). "DataStax Partners With Oracle In $46B Database Market". Forbes.com. Archived from the original on 5 September 2018. Retrieved 5 September 2018.
  32. ^ Harris, Derrick (20 September 2011). "DataStax gets $11M, fuses NoSQL and Hadoop". gigaom.com.
  33. ^ a b Carey, Scott (4 October 2017). "How DataStax wants its NoSQL platform to drive the 'right now economy'". Computerworld UK. Archived from the original on 5 September 2018. Retrieved 5 September 2018.
  34. ^ Miller, Ron (12 April 2016). "DataStax adds graph databases to enterprise Cassandra product set". techcrunch.com.
  35. ^ "DataStax CEO launches new CX strategy – focus shifting from tech to business". diginomica. 15 March 2017. Retrieved 12 September 2017.
  36. ^ Sargent, Jenna (19 April 2018). "DataStax Enterprise 6 released with double the Apache Cassandra performance". San Diego Times.
  37. ^ Whiting, Rick (17 April 2018). "DataStax Pushes The Cloud Database Performance Boundary With New Release". crn.com.
  38. ^ "DataStax announces the release of DSE 6.7". datastax.com.
  39. ^ "DataStax". crn.com. 28 April 2020. Retrieved February 22, 2021.
  40. ^ "DataStax Announces Vector Search for DataStax Enterprise". datanami.com. Retrieved December 1, 2023.
  41. ^ "DataStax raises $115M to advance its data stack". techtarget.com. Retrieved August 8, 2022.
  42. ^ "Venture Capital-Backed Tech Firm Exits To Watch In 2021". forbes.com. Retrieved February 22, 2021.

External links edit