Posts

CRISP DM first phase Data Understanding using Exploratory Data Analysis (EDA) (MatPlotLib and Seaborn)

Image
Analysis of Stock Prices of 4 Information Technology majors from 1st January 2009 to 1st January 2018. Four IT majors are: Oracle (ORCL) Microsoft (MSFT) SAP (SAP) Accenture (ACN) Used Pandas Data Reader for fetching data from Morning Star using Remote Data Access This exercise is just to practice data visualisations, no advice on Stock prices is provided here. This exercise is to see how all four IT majors progressed their way from 1st January 2009 to 1st January 2018. The very first step involved getting the data from Morning Star: # Oracle dataframe ORCL = data.DataReader("ORCL", 'morningstar', start, end) ORCL_New = ORCL.reset_index(level=0) # Microsoft dataframe MSFT = data.DataReader("MSFT", 'morningstar', start, end) MSFT_New = MSFT.reset_index(level=0) # SAP dataframe SAP = data.DataReader("SAP", 'morningstar', start, end) SAP_New = SAP.reset_index(level=0) # Accenture datafr...

CRISP DM

Image
CRISP DM stands for Cross Industry standard process for Data Mining. This is an open standard process followed by data miners. For details visit the below link: CRISP DM There are six phases involved in Data Mining: Business Understanding Data Understanding Data Preparation Modeling Evaluation Deployment
Information on Oracle Business Intelligence Cloud Services (BICS) After Oracle Open world the buzzword is Oracle BI Cloud Services (BICS). Stoppage of further Informatica version of Oracle BI Applications 7964 (OBIA) and focus on Oracle Data Integrator is talking point. Customers are worried about the direction taken by Oracle as they invested a lot and suddenly Oracle changed its focus. For Service providers it's a challenge as well because on behalf of Oracle we have to convince our Customers how smoothly we can move them to this new world without impacting their business. Oracle BI Cloud Service is bundled with 2 products: 1. Oracle BI 2. Oracle Database Oracle BI Cloud Service is one of the Platform as a Service (PaaS) that is provided by Oracle Cloud. In a nutshell we can define BICS in 4 steps: 1. Load Your Data - In present BICS for all kind of reporting we need to upload data to Cloud Oracle database. Sources can be files, on premise systems (local sy...

Oracle and Metadata Management

Gartner defines Metadata Management is all about the organization's management of its data and information assets. Metadata describes various facets of an information asset to improve its usability throughout its life cycle. Enterprise metadata management (EMM) encompasses the roles, responsibilities, processes, organization and technology necessary to ensure that the metadata across the enterprise adds value to an enterprise's data and information. Reducing Risk and Increasing Trust in Data Enterprise Metadata Management reduces the risk inherent in making any changes to the data in the organization, be it from the source to the final report. Where many business applications and decision systems rely on the same set of data, it is important to be able to authoritatively determine the impact that a change can affect to upstream and downstream applications. By providing in depth, easy to read and interpret lineage reports, Enterprise Metadata Management reduces these risks by...

OBIEE RPD - Physical Layer design standards

OBIEE RPD consists of three layers - Physical, Business Model & Mapping (BMM), and Presentation. Physical layer is where information about physical data sources are stored. BMM layer is where measurements and terms used in business are mapped to physical sources. Presentation layer is where business model is customized for end users. As per Oracle standards/recommendation we use Import Wizard for creating a new repository file and import metadata. Connection Type used in Import Wizards: Connection type is nothing but drivers/native Application Programming Interface (API) used for accessing different databases. Some databases may be accessed using native APIs, some using ODBC. Database Object - Connection Pool Connection pool regulate access to data source. Every data source must have at least one connection pool. Connection pool provide connections to multiple concurrent queries (data source requests) immediately because of enable connection pooling. This featu...

OBIEE initialization file

The initialization files for OBIEE are the backbone of it's functioning. There are three important initialization files present in OBIEE server location: 1. NQSConfig.ini 2. DBfeatures.ini 3. NQClusterConfig.ini (not present in OBIEE 11g replaced by xml file) NQSConfig file contains few important parameters. Understanding those parameters help in resolving few issues instantly. For example: PROJECT_INACCESSIBLE_COLUMN_AS_NULL Security enabled columns can be displayed to unauthorized users. There are two users. One having access to all the columns of a presentation table and other user has access to few columns only (unauthorized user). When a report is built on this presentation table with all the columns included, unauthorized user trying to access this report will error out due to insufficient privileges. Once this parameter is enabled (YES), unauthorized user can see this report. A NULL expression replaces the inaccessible column in the query and secured column...