Big Data Use Column Encoding Adding compression to large, uncompressed columns will have a big impact on cluster performance. Compression accomplishes two things: Reduce storage utilization . Because file compression reduces the size footprint of... Donal Tobin Read More
Big Data Use Amazon Redshift Spectrum for infrequently used data Amazon Redshift launched with disruptive pricing. To compare the cost, we’re looking at the price for storing 1TB of data for one year ($ / TB / Year). With a... Abe Dearmer Read More
Big Data Query Optimization: How to efficiently compare two rows in a SQL query Table of Contents The Simplified Problem A Better Solution A Real-World Example Lessons Learned Query optimization that dramatically reduces runtime for queries which use window functions . The Simplified Problem... Donal Tobin Read More
Big Data How Wish Built Their Data Pipeline with Amazon Redshift Wish Wish is a mobile commerce platform. It provides online services that include media sharing and communication tools, personalized and other content, as well as e-commerce. During the last few... Abe Dearmer Read More
Cloud Integration Using Opsworks and HAProxy for Routing At Integrate.io, with much of our infrastructure on AWS, we try to make use of the various AWS services available to us. One of these is Amazon Opsworks. Mark Smallcombe Read More
Big Data Benchmarking the Performance of Amazon Redshift ra3.16xlarge versus ds2.8xlarge instances A first look at the new RA3 Amazon Redshift node type Table of Contents Introduction Specs Copy Performance I/O Performance Real-world performance Separation of Storage and Compute Conclusion Introduction Today... Abe Dearmer Read More
Cloud Integration Understanding NoSQL Databases Is a NoSQL database better than one that follows the relational model, or vice versa? Donal Tobin Read More
Cloud Integration AWS Announces DS2 Amazon Redshift with 50% Better Performance at the Same Price AWS announced yesterday that they are rolling out a newer generation DS2 Amazon Redshift that has 50% better performance compared to what they previously offered, at the exact same price. Mark Smallcombe Read More
Cloud Integration Moore’s Law and Kryder’s Law Everything from cloud-based data warehousing solutions like Amazon Redshift to data storage techniques like database replication have evolved due to these massive strides in technology. Donal Tobin Read More
ETL 46 min Top 25 ETL Tools (Updated Dec 2025) Integrate.io lists the 14 best ETL software tools for 2025 based on features, user review scores, and more. Which ETL tool should you choose? Abe Dearmer Read More
ETL 15 min SFTP to Salesforce – Guide to a Secure Integration Integrating data from your systems into Salesforce is a critical process. Here is a guide featuring the SFTP to Salesforce process. Donal Tobin Read More
Data Integration 10 min What Is Operational ETL? Delving into Operational ETL: Learn how it transforms business data handling for enhanced productivity. Donal Tobin Read More