Innovative Routines International

28°08′32″N 80°34′56″W / 28.1422°N 80.5822°W / 28.1422; -80.5822

IRI, The CoSort Company
Company typePrivate
IndustryData processing, Sorting, Data integration, Test data, Data masking, Data conversion
Headquarters,
USA
ProductsCoSort, Fast Extract (FACT), NextForm, RowGen, FieldShield, CellShield, DarkShield, Voracity
Websitewww.iri.com

Innovative Routines International (IRI), Inc. is an American software company first known for bringing mainframe sort merge functionality into open systems.[1] IRI was the first vendor to develop a commercial replacement for the Unix sort command, and combine data transformation and reporting in Unix batch processing environments.[2] In 2007, IRI's coroutine sort ("CoSort") became the first product to collate and convert multi-gigabyte XML and LDIF files,[3] join and lookup across multiple files,[4][5] and apply role-based data privacy functions (including AES-256 encryption) for fields within sensitive files.[6]

IRI is headquartered in Melbourne, Florida, United States, and has resale and support offices in 25 countries,[7] including France, Japan, South Africa, and Brazil.[8] Primary computing platform partners include HP,[9] IBM,[10] Fujitsu,[11] Intel,[12] Novell,[13] Red Hat, Sun Microsystems, and Microsoft.[14] CoSort users include: AIM Healthcare,[15] EDS,[16] HSBC Insurance,[17] and Thomson Reuters.[18] The company was named a 'Most Promising Big Data Solution Provider' by CIOReview in 2015 as it launched "Voracity" to support Hadoop processing, NoSQL data sources, etc.[19]

Products edit

IRI software is designed to transform, convert, report, and protect large data volumes rapidly in distributed, heterogeneous computing environments.[20] These functions are built into the CoSort package or through spin-offs for data extraction, generation, security, and migration. Each tool uses the same graphical IDE built on Eclipse, and metadata format for defining and manipulating data.[21] IRI's open data definition file format is also supported by AnalytiX DS and Meta Integration Technology (MITI) so that third-party ETL, BI, and data modeling tool users can convert or re-use their existing metadata in IRI product environments.[22]

IRI CoSort edit

CoSort was released for CP/M in 1978, DOS in 1980, Unix in the mid-eighties, and Windows in the early nineties,[23] and received a readership award from DMReview magazine in 2000,[24] CoSort was initially designed as a file sorting utility, and added interfaces to replace or convert the sort program parameters used in IBM Infosphere DataStage, Informatica, Micro Focus COBOL, JCL, NATURAL, SAS, and SyncSort Unix.[25]

In 1992, CoSort added related data manipulation functions through a control language interface based on DEC VAX/VMS sort utility syntax,[26] which evolved through the years to handle file-based data integration and staging functions in data warehouse ETL operations:[27]

For data warehouse and data mart applications, CoSort performs source data extraction, data cleansing, sorting, reformatting, data type conversion, aggregation, and indexing, all in a single pass. Most operational data in commercial and public sector enterprises reside internally in sequential flat files, (relational) database tables, or are imported from data tapes and transmissions generated externally. These historical databases are optimized for ad hoc queries and transactions, rather than for extraction. CoSort accepts multiple input files (large-scale tables or flat-file data dumps), or records streaming through pipes, to perform conditional selection on records for downstream processes.

— Dennis Hill, Database Trends Magazine, July 1999[28]

CoSort Version 9 releases, begun in 2007, can simultaneously transform, convert, report, and/or protect data for ETL, business intelligence, change data capture, database load and query,[29] application development, and data migration activities. Version 10 was released in 2018, adding support for semi-structured, streaming, and cloud data sources.

IRI Voracity edit

IRI Voracity is a data management platform released in 2016 for data discovery, integration, migration, governance, and analytics. It consolidates key data curation activities in the IRI Workbench GUI (built on Eclipse (software)™), and transforms data in the CoSort engine or optionally in MapReduce, Spark, Spark Stream, Storm, or Tez. Voracity includes most standalone IRI tools, and adds data profiling, ETL, metadata management, master data management, data federation, and multiple job design and control capabilities.[30]

Other tools edit

IRI CoSort, IRI FACT, IRI NextForm, and IRI RowGen are products in the IRI Data Manager suite. IRI FieldShield, IRI CellShield, and IRI DarkShield are products in the IRI Data Protector suite.

IRI FACT edit

FACT (FAst ExtraCT) is a high-performance unload utility for Oracle, IBM Db2, Sybase ASE and IQ, SQL Server, MySQL, Altibase, and Tibero. It exports large tables in parallel to flat files for archive, ETL, reorg, reporting and other applications.[31] FACT and CoSort used together "provide for rapid unloading and transformation of data in Oracle databases in support of ETL processes."[32]

IRI NextForm edit

NextForm is a data migration spin-off from CoSort functionality designed to convert between structured file formats such as CSV, ISAM, LDIF, and XML,[33] plus data types such as ASCII, EBCDIC, Unicode, and Packed Decimal.[34] Newer NextForm editions can structure data in unstructured sources, convert COBOL Vision files, and facilitate database migration and replication.[35]

IRI RowGen edit

RowGen is designed to generate test data in production table, file, and report formats for prototype database population, compliance, outsourcing, and application prototyping projects.[36][37] RowGen's GUI parses data models to define table layouts and relationships so database test sets are structurally and referentially correct.[38] RowGen can also transform and format test data during its generation.[39]

IRI FieldShield edit

FieldShield is a CoSort spin-off designed to protect data privacy in structured and semi-structured data sources.[40] The software protects personally identifiable information and other private data at the field or record level within database tables, files and other sources subject to data spill.[41] Privacy functions include AES encryption, data masking, and pseudonymization. Job details can be audited from a log file in XML format.[42]

IRI CellShield edit

CellShield is a data discovery and masking product designed for protecting data at the cell level in Microsoft Excel spreadsheets. CellShield comes in Personal and Enterprise editions, with the latter capable of finding and remediating PII in multiple files and sheets in drives and folders accessible on a local area network.

IRI DarkShield edit

DarkShield is a data discovery and masking product designed for protecting data hidden in so-called dark data, or unstructured file, repositories. DarkShield shares the same data searching, classification, and masking functionality with FieldShield and the CellShield Enterprise Edition in IRI Workbench.

IRI Workbench edit

The Workbench is a graphical user interface (GUI) and integrated development environment (IDE) for all IRI software products, built on Eclipse™. The Workbench is a free, optional place to design, run, and manage data connections,[43] metadata, and jobs, and to use third-party plug-ins for business intelligence, data modeling, version control, etc.

References edit

  1. ^ Wilkinson, Stephanie, "Applications", UNIX Today! (November 14, 1988)
  2. ^ Miller, David B., DP Labs: "A Better Sort of Sort", HP Professional 7(2) (February 1993)
  3. ^ IPFrontline Article (June 11, 2007)
  4. ^ Complex Joins and Lookups Now Run Outside a DBMS, Information Management Online (August 21, 2007)
  5. ^ "CoSort v9", IBM Systems Magazine (November 2007)
  6. ^ Munshi, Renee, "CoSort Adds Data Protection at the Field Level", WindowsITPro (June 14, 2007)
  7. ^ "IRI web site". Retrieved July 21, 2009.
  8. ^ "3CON Brazil News". Archived from the original on 2011-07-06. Retrieved 2009-07-30.
  9. ^ "Infrastructure Partners".
  10. ^ "IBM System z Linux Products".
  11. ^ "Fujitsu Solutions Catalog". Archived from the original on 2011-07-11. Retrieved 2009-07-30.
  12. ^ "Intel Early Release Program".
  13. ^ "SUSE Linux YES Certification".
  14. ^ "Microsoft Mainframe Migration Partners". Microsoft.
  15. ^ Information Management Magazine (July 2008)
  16. ^ Information Management Magazine (July 2005)
  17. ^ Information Management Magazine (July/Aug 2009)
  18. ^ Information Management Magazine (Jan/Feb 2009)
  19. ^ "BigData-IRIVoracity-CIOReview2015.PDF".
  20. ^ Beall, Scott. K & Hodges, Robert L., Sort/Merge: Software Comparison Columns, Gartner Research DPRO-91136 (July 16, 2002)
  21. ^ Global Research Partners (July 23, 2007)
  22. ^ Data-Conversion.org (April 24, 2006)
  23. ^ IRI company history (retrieved August 1, 2009)
  24. ^ "DM Review 100", DM Review, 10(12), (December 2000)
  25. ^ "IRI Whitepaper, "Third-Party Sort Replacement and Conversion Tool Examples"".
  26. ^ Product Brief, Software Magazine (May 1992)
  27. ^ ITToolbox, Oracle Database Connections (June 25, 2009)
  28. ^ Hill, Dennis M., "CoSort: The Emerging ETL Engine", Database Trends (July, 1999)
  29. ^ Burleson, Donald K., "Hypercharging Oracle Data Loading" (February 1, 2004)
  30. ^ IRI Announces Total Data Management Platform, BusinessWire (December 1, 2015)
  31. ^ High Speed Oracle Data Extract and Reload. Burleson Consulting
  32. ^ Friedman, Ted, et al., "Magic Quadrant for Data Integration Tools", Gartner Report (September 22, 2008)
  33. ^ JAXenter Portal, IT Republik (June 11, 2007)
  34. ^ COBOL User Group, File Conversion Tools Archived 2009-03-01 at the Wayback Machine (retrieved August 3, 2009)
  35. ^ CoSort Journal (March, 2014)
  36. ^ "IBM PartnerWorld Global Solutions Directory".
  37. ^ OTN Partner News, Oracle Magazine (March, 2008)
  38. ^ SQL Server Magazine (November, 2008)
  39. ^ Information Management Magazine (July, 2007)
  40. ^ "datagovernancesoftware.com/cat5".
  41. ^ Jarvis, Darick, "5 Tips for Protecting Sensitive Data" Archived 2011-07-08 at the Wayback Machine, Data Storage Connection (September 4, 2007)
  42. ^ Koopman, James, "Are You Compliant within Flat File Processing?" (April 13, 2009)
  43. ^ 136 Data Sources and Targets (IRI Web Site, July, 2014)

External links edit