Sometimes information systems professionals can draw some very useful analogies from the hard sciences. For example physics offers us one very useful concept – the states of matter. Figure 1 shows an excellent diagram that I copied from Wikipedia explaining the four states of matter.

States of Data

Looking at these four states, I’m going to suggest that basically there are really just two fundamental states of matter – bound and liberated. Bound connotes that molecular cohesion binds the matter into a semi solid to solid state, while liberated signifies that the matter overcomes such molecular cohesion to more freely “bounce around” and thus be “anti solid“.

It’s my contention that much like matter data, regardless of whether in a database or some other store, has two basic states : inert (i.e. at rest) and mobile (i.e. in motion). This simple concept can be very useful when talking about data. It provides a simple, logical point of reference that can be useful when talking about things such as tools to work with data.

Within the physics of matter framework, if I tell you that the water is in its liquid state then you instantly think of tools that might be useful to work with in in that state such as a rag, sponge, bucket, mop, wet vac, etc. If I then say we need a tool to convert the water from its liquid state to a gaseous state, again you instantly think of tools for that job such as a candle, Bunsen burner, hot plate, microwave or boiler. Thus knowing the state can help us to decide what tool to use for working with that state.

The same kind of thought process can work for the states of data. If I say that the data is at rest, then tools such as an ER diagrammer or data modeler might be useful to examine the design or structure of that data. Likewise if my goal is to put that data in motion then a script, program, utility or software tool to read the data from its current source to write it to a new target springs to mind. This process generally referred to as “data migration”. Cirro offers an extremely capable yet simple data migration tool called Data Puppy. It offers many other useful features, but it’s core function is to simply and quickly move lots of data from point A to point B no mater what A and B are in terms of databases.

Now we can further leverage the physical sciences example to distinguish between a data migration solution such as Data Puppy and an ETL (Extract, Transform and Load) tool. Returning to the physics of matter framework if I now instead tell you that we desire to deconstruct the water into its base elements (Hydrogen and Oxygen) or construct water from those same base elements, then we need radically different tools. We instinctually know that cannot use a simple tool like a beaker or Bunsen burner for this task. We need more complex tools to cause more costly chemical reactions such as an Electrolysis tank to break water apart. That’s akin to an ETL tool for data, because those tools run expensive logic on every row of data to either cleanse, separate or join data. Not only are they expensive in terms of execution, but also generally high in cost for training and time to define tasks.

So when you simply need to move data, think Data Puppy.

Join the conversation! 1 Comment

  1. […] jeopardy was generally when it was placed into motion by an application (refer back to my blog on The Physics of Data in Motion for additional detail). So data thieves would look for ways to hack or trick application code to […]



Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

About bscalzo2

Bert Scalzo is an Oracle ACE, blogger, author, speaker and database technology consultant. His work experience includes stints as product manager for DBArtisan and Rapid SQL at IDERA and chief architect for the popular Toad family of products at Quest Software. He has three decades of Oracle® database experience and previously worked for both Oracle Education and Oracle Consulting. Bert holds several Oracle Masters certifications and his academic credentials include a BS, MS and Ph.D. in computer science, as well as an MBA. He has presented at numerous Oracle conferences and user groups, including OOW, ODTUG, IOUG, OAUG, RMOUG and many others. Bert’s areas of interest include data modeling, database benchmarking, database tuning and optimization, "star schema" data warehouses, Linux® and VMware®. He has written for Oracle Technology Network (OTN), Oracle Magazine, Oracle Informant, PC Week (eWeek), Dell Power Solutions Magazine, The LINUX Journal,, Oracle FAQ and Toad World. Bert has also written the following books: • Oracle DBA Guide to Data Warehousing and Star Schemas • TOAD Handbook (1st Edition) • TOAD Handbook (2nd Edition) • TOAD Pocket Reference (2nd Edition) • Database Benchmarking: Practical Methods for Oracle & SQL Server • Advanced Oracle Utilities: The Definitive Reference • Oracle on VMware: Expert Tips for Database Virtualization • Introduction to Oracle: Basic Skills for Any Oracle User • Introduction to SQL Server: Basic Skills for Any SQL Server User • Toad Unleashed • Leveraging Oracle Database 12cR2 Testing Tools • Database Benchmarking and Stress Testing (coming 2018)