You may not realize this, but Apache Spark is a framework. Spark is cluster-computing engine that manages parallel executions extremely well. Spark enables other technologies including Java, Scala, Python, R, and graph processing. Spark stitches together previously-disparate functionality into a cohesive, syntactically-similar set of commands. Spark’s architecture is library-driven and includes the following libraries: Spark SQL …
Continue reading On Data Frameworks…
Category:SSIS Catalog Compare
Data Integration Life Cycle Management with SSIS is Available!
My latest book – Data Integration Life Cycle Management with SSIS – is now available! In this focused (185-page) book, I walk through ways to manage SSIS in the enterprise using what ships in the box. I point out what works well and what’s missing. I share workarounds for the stuff that’s missing, and then I …
Continue reading Data Integration Life Cycle Management with SSIS is Available!
Data Integration Life Cycle Management with SSIS: A Short Introduction by Example is Available for Pre-Order!
I’m excited to announce that book #3 for 2017 is available for pre-order at Amazon! Data Integration Life Cycle Management with SSIS: A Short Introduction by Example is a(nother) short-ish book (~112 pages) that offers insight into managing SQL Server Integration Services (SSIS) in an enterprise. I wrote the book to demonstrate managing SSIS as it …
Continue reading Data Integration Life Cycle Management with SSIS: A Short Introduction by Example is Available for Pre-Order!
Expert SSIS Training 4-5 Dec 2017!
Yep, we’re doing it again. In cooperation with Brent Ozar Unlimited, the next delivery of Expert SSIS Training is 4-5 Dec 2017. I had a great time delivering this course with Brent in September and I’m excited to have the opportunity to share this material with you again! If you register soon (written 27 Sep …
Continue reading Expert SSIS Training 4-5 Dec 2017!
Coming Soon: SSIS Catalog Compare Updates
I spent a lot of time this year writing and managing a book project. I will blog about that later, I promise. While I enjoy writing it really cuts into my play time. What do I do when I play? I write software. This weekend I cracked open the code for SSIS Catalog Compare and …
Continue reading Coming Soon: SSIS Catalog Compare Updates
The Recording for Designing an SSIS Framework is Now Available
Kent Bradshaw and I had a great crowd, lots of awesome questions, and delivered good demonstrations at the Designing an SSIS Framework webinar. You can view the webinar for free (although registration is required) here. In the webinar Kent and I discussed the following Data Integration Lifecycle Management Suite products: SSIS Framework Community Edition – free and open source Catalog Reports – …
Continue reading The Recording for Designing an SSIS Framework is Now Available
What is the DILM Suite? 2. SSIS Catalog Compare
SSIS Catalog Compare is really two products: SSIS Catalog Compare (GUI – graphical user interface) and CatCompare (CLI – command-line interface). This post will focus on GUI functionality as it’s easier to demonstrate. SSIS Catalog Compare provides the following functionality to enterprise Data Integration Lifecycle Management (DILM): Compares the contents of two SSIS Catalog instances. …
Continue reading What is the DILM Suite? 2. SSIS Catalog Compare
Deploying SSIS Projects to a Restored SSIS Catalog (SSISDB)
This post was originally shared on SQLBlog. You’re coding away with SSIS, happy as a clam, and it’s now time to deploy your hard work to an instance of the SSIS Catalog. If you’re in Visual Studio, you right-click the project name in Solution Explorer and click Deploy. If you’ve been given an ISPAC file, …
Continue reading Deploying SSIS Projects to a Restored SSIS Catalog (SSISDB)
Data Wrangling Concepts: Latency and Staleness
This post was originally published at SQLBlog. Latency is the amount of time between data creation and load. Staleness is the amount of time since data in a target was refreshed. In many data integration use cases, a latency or staleness of one day, one week, or even one month is acceptable. Enterprise Data & Analytics helps enterprise data integration teams …
Continue reading Data Wrangling Concepts: Latency and Staleness
An Example of Data Integration Lifecycle Management with SSIS, Part 4
In this post I will demonstrate advanced options for executing SSIS packages in the SSIS Catalog. I’ll also demonstrate SSIS Catalog Browser, SSIS Framework Community Edition, Framework Browser, and Catalog Reports – free utilities and solutions from DILM Suite that may help as you implement your enterprise Data Integration Lifecycle Management (DILM). The Problem I …
Continue reading An Example of Data Integration Lifecycle Management with SSIS, Part 4