IHS Energy Group to merge PI and Dwights well files (May 1999)

Two US well and production files from Petroleum Information and Dwight's Energydata, are being merged into new databases; PIDplus (Production) and PIDplus (Wells).

Well and production data will be linked through new, standardized names and codes to provide what IHS describe as "a single source for the most complete, accurate, and easily accessed U.S. well and production information available anywhere".

50 man-years

Since the 1997 merger, PI and Dwights have spent fifty man-years integrating the two production and well files. The integration is an ongoing process and does not appear to offer a totally unified product. In fact the deliverables initially will be a production data base and a separate well database.

data first

The focus initially is on data clean up rather than data modeling. The integration process included detailed regional comparisons, audits and matching procedures.


Producing properties will be matched to the original well completions via the API number in an ongoing audit and data correlation project. New export formats will also be available.

The new production database will include proprietary files from old state records which are no longer available at state agencies. IHS Energy claim that such data will be an advantage to those who depend upon "accurate, complete production volumes in their business".


Importantly, mechanisms will be in place to allow direct feedback from the industry. The new well database merges 2.8 million wells from PI and 1.3 million from Dwights in a procedure described as "lengthy and complex, utilizing intensive data processing and manual intervention".

code tables

Linking the production and well datasets will be achieved through the Standard Codes Project which will synchronize the codes and nomenclature used in the different data sources. These codes will be implemented in our new integrated PIDplus databases. These new codes will follow the design of PI’s PIDM relational data model, itself based on the Public Petroleum Data Model.

PDM comment. What interests us about this project is the ratio between the effort in data cleanup and data modeling. While more intellectual satisfaction may be had from discussing entities and relations, the real world task of matching million-well datasets involves hand-crafted matching of individual data items.


In short, lots of real grunt work. The new code tables appear almost simultaneous with PPDM and POSC’s collaboration in this field. Some quick work may be required here to align these different initiatives.

Click here to comment on this article

If your browser does not work with the MailTo button, send mail to pdm@oilit.com with PDM_V_3.3_9905_2 as the subject.

© Oil IT Journal - all rights reserved.