We joined in Knowledgent’s first ever Tekathon – an immersive education session bringing together Knowledgent Informationists and Big Data vendors. It was a great opportunity to discuss Big Data real-world use cases with folks who are as committed to customer success as we are. After introductions, we got into the nuts and bolts of exactly how Novetta Entity Analytics finds the dots and connects them so enterprises can realize the promise of their Hadoop investment.
We also dug into details around features that differentiate Novetta Entity Analytics: enterprise performance and rapid scalability, built-in intelligence, relationship identification, and flexibility. Showing Novetta Entity Analytics in action is a powerful thing – so we ended our session with a demo showcasing how we address security threat assessment, advanced customer 360 analysis, and fraud detection issues within an enterprise.
Thanks to Knowledgent for the great discussion, ideas and interactions. The recap below was originally published here on their blog.
As Informationists from Knowledgent, we find ways for our clients to eliminate boundaries of business needs and information silos and have them focus on outcomes. Increasingly, exploration and innovation are demanding more from information assets. Data gurus are being presented with messy data infrastructures that hide information of muddy quality. Value is locked in unstructured data assets, leading to misinterpretations of the content and context data. We need a way to find value in our data from variety of formats within our information ecosystem. Our clients look for reduced effort to extract, consolidate, and clean data to produce reports.
Industry demands that vendors come up with solutions that eliminate redundant and duplicate tools. Capabilities for data stewardship and governance, including data profiling, data quality management, workflow management, security, transparency in matching logic, data visualization and manipulation, dashboards, and reporting are no longer “good-to-have,” but rather are a basic need.
Novetta, a next-generation technology vendor, is making headway in this space. Novetta specializes in specific solutions to enable entity analytics, cyber analytics, open-source analytics, and multi-INT analytics.
Novetta’s Entity Analytics (NEA) solution allows ingestion of multi-format, multi-platform datasets from CRM, Salesforce, clickstream, operational, transactional, logs, public datasets, and social media. Multi-domain flexibility allows for building multiple dimensional entities. This helps to create visualizations of enriched relationships across person, organization, location, and product channels.
The Novetta Entity Analytics solution enables data mappings from disparate data sources, entities, and attributes for profiling, so an analytical user can easily understand the characteristics and statistics of their data. Data cleansing functions can be applied to improve data quality. It also has visualization dashboards for iterative data analysis and distributions and allows users to re-apply matching strategies via thresholds. It also has unique capability: a deconfliction process that splits over matches. Profiled data can be reviewed, measured, and further improvised.
Technologically, Novetta Entity Analytics manages all its datasets within a Hadoop cluster, which supports both Hortonworks (HDP) and Cloudera (CDH) platforms. Ingestion and publication of its linked datasets to enterprise systems and applications could be Hadoop enabled or via ODBC connectivity.
Novetta Entity Analytics – Multi-Dimensional Enterprise Index is a collection of tables in H-Catalog that stores all entity resolution results. Ultimately, unified entity relationships are made available to enterprise business processes for advanced analytics.
Finally, the Novetta Entity Analytics solution performs Entity resolution, a process of linking and grouping massive volumes of standardized data without the use of keys to create an “Entity Index” in H-Catalog. Additional strategies could be added or refined as the analytical user gets quick insight of its data assets. As datasets grow, linear scaling is supported so it avoids performance issues. Transparency in application of matching strategies, rather than a typical “black box” approach, is a key feature. Outputs of entity resolution are visualizations of match decisions, which allows for recognizing connections, relationships, patterns, and trends.
The Novetta Entity Analytics solution provides its users with flexibility, a high speed of ingestion, the ability to apply data quality rules, and overall ease of use for analytical users. Data can be standardized and consolidated to a merged master record with associated hierarchies and complex relationships.
It has some great capabilities in the near-term roadmap, with an extended workflow to a SOLR search index. Also, moving this index to the graph database is something to expect from this vendor.
In conclusion, Novetta Entity Analytics is a great innovative solution that meets the industry demands.