Data Quality Mining and Text Discussion

Content Type

User Generated

User

fnawnl9491

Subject

Business Finance

Description

Unformatted Attachment Preview

Question Data Management After studying this week’s assigned readings, discussion the following: 1. What are the business costs or risks of poof data quality? Support your discussion with at least 3 references. 2. What is data mining? Support your discussion with at least 3 references. 3. What is text mining? Support your discussion with at least 3 references. Answer: Answer all the 3 questions. Use APA format with more than 350 words and at least 3 references 1. Response by: Kiranmai Data quality was kept to simply the CRM frameworks. This adaptable quality presently stretches out past dealt with client Data. To begin changing the Data quality, you have to get inside the corner and recognize what accurately cause awful Data: What is Data mining? Information mining is characterized as the computational procedure of breaking down a lot of information so as to extricate designs and valuable data. Over the most recent couple of decades, information mining has been broadly perceived as an amazing yet flexible information investigation device in an assortment of fields: data innovation in primes, yet in addition clinical drug, human science, material science. In this specialized note we give an abnormal state diagram of the most noticeable errands and techniques that structure the premise of information mining. The note likewise centers on the absolute latest yet encouraging interdisciplinary parts of information mining • change, which is accountable for decreasing and anticipating the information, so as to determine a portrayal reasonable for the particular assignment to be performed; it is regularly practiced by including change strategies or strategies that can discover invariant portrayals of the information; • information mining, which manages extricating intriguing examples by picking (I) a particular information mining strategy or errand (e.g, rundown, characterization, grouping, relapse, etc), (ii) legitimate algorithm(s) for playing out the main job, and (iii) a fitting portrayal of the yield results; • translation/assessment, which is abused by the client to decipher and separate learning from the mined examples, by picturing the examples; this understanding is normally done by imagining the examples, the models, or the information given such models and, on the off chance that, iteratively glancing back at the past strides of the procedure Due to the idiosyncrasy of the fundamental information, it is clear that information examination in such testing settings can't be performed with customary information investigation methods, either manual or robotized. Information mining goes for filling this hole, with its inherent interdisciplinary nature that presents it at the convergence of various increasingly traditional fields, for example, man-made brainpower, insights, database frameworks, AI. What is text mining? Content mining is associated with setting up the unstructured sort of Data, choosing a fundamental numeric inspiration from the substance and to guaranteeing the Data is verified in the substance. The Data set away in the substance, ought to be feasibly open and the information can be utilized to look into. Thusly, basically message mining changes content into numbers. Given a lot of information objects, bunching goes for distinguishing a limited arrangement of gatherings of items, i.e., groups, so the articles inside a similar group are "comparative" to one another, while the items having a place with various groups are "divergent". The degrees of (dis)similarity among information objects are processed and assessed by a vicinity measure that can be either indicated by the client or intrinsically fused in the particular bunching calculation. In a bunching task there is no earlier learning of the class marks related to the items to be gathered; consequently, grouping is regularly additionally alluded to as unsupervised characterization, to underscore the distinction from the (managed) order task, in which the class names of the articles in the preparation set are known. A bunching of the information set of articles is therefore worked so that group cohesiveness and detachment, estimated regarding the fundamental closeness measure, are amplified. All the more accurately, grouping techniques commonly characterize a particular target capacity to be enhanced, so as to formally characterize bunches that are minimal and well-isolated from one another. Since these plans for the most part lead to computational issues too difficult to even think about being ideally understood for enormous scale inputs (the alleged NP-difficult issues), a particular grouping strategy ought to characterize the relating guess/heuristic algorithm(s) to discover great approximations of the ideal arrangement Answer : Please add your comments whether you accepting the response or against the response depending upon the question in more than 300 words. Below are additional suggestions on how to respond to your classmates’ discussions: · Ask a probing question, substantiated with additional background information, evidence or research. · Share an insight from having read your colleagues’ postings, synthesizing the information to provide new perspectives. · Offer and support an alternative perspective using readings from the classroom or from your own research. · Validate an idea with your own experience and additional research. · Make a suggestion based on additional evidence drawn from readings or after synthesizing multiple postings. · Expand on your colleagues’ postings by providing additional insights or contrasting perspectives based on readings and evidence. 2. Response: by Asrar The business is an activity which is used for making money to produce the buying or selling products as goods or services and it simply enterprise to gain the business as a profit from the company. By doing enterprise that it does not mean as it is a company and it is also for the companies having a partnership and corporation with the organization and the businessman as an owner can share their ideas to the partners and the owner is responsible for the entire business. If the business is acquired with the owners by going with the creditors by the possession is personal and does not allow to the structure of the rate of the tax department is personally taxed for the income in the form of business. They can also be used as a company that is used to separate legal entity and to provide liability to the corporate companies and is having more complicated and expensive for setting the benefits to the owner of the business (McCann, 2010). some of the leaders were considered data to analyze and to take an investment to the capability which is in the top position and the business data is to despise about the quality of the decision about the technologies. Some of the business employees were exploring some issues to the executives and on how to analyze the situation in various types for different reasons which includes information and to provide some security to the company may possess the data legally (Han, 2011). There are some challenges by the employees to improve some data quality that obscure the advantages about the cost of doing good quality data to enable the impact about to give financial support to the business and can see the loss of reputation for their low data. Data may be compared with the process of refinement about the analytic that starts with the overall world and many of the countries were trying to give information about the process of the data can be delivered (Miner, 2012). Answer : Please add your comments whether you accepting the response or against the response depending upon the question in more than 300 words. Below are additional suggestions on how to respond to your classmates’ discussions: · Ask a probing question, substantiated with additional background information, evidence or research. · Share an insight from having read your colleagues’ postings, synthesizing the information to provide new perspectives. · Offer and support an alternative perspective using readings from the classroom or from your own research. · Validate an idea with your own experience and additional research. · Make a suggestion based on additional evidence drawn from readings or after synthesizing multiple postings. · Expand on your colleagues’ postings by providing additional insights or contrasting perspectives based on readings and evidence.
Purchase answer to see full attachment

Explanation & Answer:

3 Questions

2 replies

User generated content is uploaded by users for the purposes of learning and should be used following Studypool's honor code & terms of service.

Explanation & Answer

Thank you for working with me

Running head: DATA QUALITY, DATA MINING, AND TEXT MINING

Data Quality, Data Mining, and Text Mining
Name
Institutional Affiliation
Date

1

DATA QUALITY, DATA MINING, AND TEXT MINING

2

Data quality
Data quality refers to the ability of data in serving its purpose in a given context (Hazen,
Boone, Ezell, & Jones-Farmer, 2014, p. 72). Data quality gets determined by various factors,
including reliability, accuracy, completeness, and relevance. Having data quality proof is very
important in that it promotes the trust of customers to an organization (Kwon, Lee, & Shin,
2014). Proof of data quality, on the other hand, brings about additional costs and risks like the
fact that the proof data quality may lead to substantial variation in which one enterprise benefits
while others do not. The additional costs are because the proof of data quality may indicate that
some areas require more attention hence resulting in the neglect of the other regions(Cai & Zhu,
2015).Cai and Zhu, (2015),also assert that quality data requires a highly skilled workforce in its
production, which might be very expensive to acquire. In addition to substantial variation and
cost, proof data quality may also result in the risk of incorrect predictions. From the available
data, managers may make decisions concerning the future of their organizations, decisions that
are at risk of falsification should the dependent variables change(Cai & Zhu, 2015). Also, proof
data quality may lead to the undermining of decisions made by experienced managers that differ
from the proof which is a challenge because the decisions of these managers may be the correct
ones in the longrun(Cai & Zhu, 2015).
Data mining
Data mining is the extraction of non-trivial implicit, previously known and potentially useful
patterns or knowledge from vast amounts of data (Aggarwal, 2015). Data mining, according to
Aggarwal, (2015), is the analysis step in Knowledge Discovery in Databases (KDD). According
to Zaki, Meira Jr, andMeira, (2014), the term data mining got introduced in the 1900s with the
old methods of identifying patterns in data having consisted of the Bayes' Theorem of the 1700s

DATA QUALITY, DATA MINING, AND TEXT MINING

3

and the Regression Analysis in the 1800s. This article section highlights the main research areas
of data mining, the kinds of data that data mining gets done on explains the main functionality
and the processes in data mining and states the major issues in data mining.
According to Aggarwal (2015), data mining is widely applicable and can get used for market
analysis and management, risk analysis, management, and fraud detection, and detection of
unusual patterns. From the broad applications of data asAggarwal, (2015), further asserts, the
main research areas of data mining are in medicine and manufacturing engineering. Data mining
usually gets done on data stored in relational databases, data warehouses, transactional databases
and other data in advanced databases and information repositories like time-series data, spatial
and temporal data and stream data (Roiger, 2017).
Data mining has many functionalities including classification and prediction through the
construction of models that describe and distinguish classes and concepts for future projections,
an association which involves correlation and causality, cluster analysis, outlier analysis, trend
analysis and...