Sandbox‎ > ‎IPT 2010-11‎ > ‎Vithoo's Page‎ > ‎

Vithoo's Classwork

Data Mining and Data Warehouse

posted Nov 29, 2010, 1:40 AM by Unknown user

Each of the following items relates to (a) Data Warehouses and (b) Data Mining.

1.        Describe what each is.

2.       Outline the benefits that can be gained by organisations that keep these

3.       Explain with examples how each could potentially be abused

4.       Using the internet, identify two companies that do each (for a total of four companies) and justify (with examples) the use of such methods for each company.

 

1.       Data Warehouses is a database that collects information from different data sources which is collected and placed together in order to develop relationships. It is a storage area of raw data that can be analysed to assist organisations to make decisions. Data Mining is a process that looks for relationships and patterns in the data stored in a database. It sorts data and turns up interesting and useful connections.

 

2.       Data Warehouses will allow organisations to draw data from several sources in order to develop an accurate reading (analysis of situation) before a decision is made. It gives access to large amounts of information. Providing a broader idea of the context of any decision to be made, and moreover allows these ideas to be simplified such that they can be analysed simply. Data Mining uses this concept to provide organisations to look into possible future predictions and relationships of how it works and possible improvements which could be made.

 

3.       Data Warehouses and Mining will raise issues of privacy and ownership of data. Data Warehousing may develop into a system which sources data which is completely irrelevant to the organisation's purpose and invades another's privacy. Intern, the copyright of some data may also be breached. How a system conducts itself whilst developing data warehouses is hence a key issue. For example, authorities how need to know much of this data, but others with malicious intents may use this data for purposes other than that related to the organisation worked for. Data Mining may again lead to relationships which deviate from the original purpose of the organisation. It can lead to the development or holding of personal data such as phone or credit card no. which may invade one's privacy. For example, a banker who holds their customer data may mine through personal data such as address and phone number instead of customer ID and their financial situation. Furthermore, if a script of a play was to be mined from a data warehouse of an acting institution which is yet to be performed, data ownership would become an issue raised of whether the organisation can use it.

 

4.       SMH- SMH needs to perform data warehousing in order to gather and publish all its articles on the internet as well as the daily papers. Not only that, it like all other companies needs to research optimal timings to advertise and campaign for the purchase of their newspaper product. It is essential for this to occur in order to optimise customer purchase of the newspaper and hence to entire organisation's livelihood. For example, TV advertisements for the newspaper which include a special toy of a sought will require a data warehouse as well as the data mining technique in order to discover when the most suitable time for displaying children/family related advertisements with optimal target audience viewing is possible.

 

Google- Google continuously needs to discover ways to improve its services through continual market research, which like SMH can be performed through utilising a data mining which identifies peak hours of use as well as relationships between certain periods of time and particular searches. From this, the organisation may have the ability to develop and improve its current services from this mined information. Furthermore, Google maps requires a constant feed of traffic flow data updates from which the product itself can be made most accurate. Accumulating such vast amounts of data from all over the world considering the traffic of all the cities which use this service also shows the need data warehousing as a technique to develop this service.

 

McDonalds- McDonalds, like SMH, also needs to find ideal times to run particular commercials as well as the placement of billboards and posters. Through the use of a data warehouse, accurate data which converges several statistics will not provide a guessing method of timing particular advertisements but a concrete and justifiable timing to all TV and Radio ads. For example, a Happy Meal TV advertisement would most likely be useful between the hours of 7-8am as well as 3-5pm, which is where most children watch the TV. The placement of billboards can also be influenced by collecting data concerning the number of people who pass a certain road as well as the timings of these. This can be used by McDonalds via the technique data mining as a method of understanding why a breakfast deal-promoting poster should be placed in front of a highway; as over 10,000 people pass through between 6-7am on the way to the city (i.e. Work) which is ideal for such promotion.

 

Apple- As a result of its advancement as an expansive global selling company, Apple is likely to have large amounts of data concerning its product development, feedback as well as any issues with its products. In order to develop a clear method of analysing this data into meaningful information, data mining can be used in order to more accurately developing feedback and issue consultations. In order to continue discovering world-first breakthroughs in technology globally, data warehousing is necessary to keep the organisation up to date. Once again, developing accurate market research through data warehousing in order to confirm advertising logistics is also possible by researching the movements of the followers of technology.

Critique the Website- Point the finger

posted Oct 18, 2010, 3:59 PM by Unknown user

Ok so our website was a bit bland to an extent. I guess some of the pages were just a wall of text. as a group, we worked well, but some of the credit in pieces of work was not given out, especially as the interview occurred before the rush work.
 
I actually did really well, considering I had continued to sulk on my exam results =(
 
umm.. the innovate with technology was a bit bland as well according to others, but it was our group register which put us down. Next time we should be a bit more formal.

Personality Test

posted Oct 17, 2010, 4:07 PM by Unknown user

My Type Is: ENTJ
 
Extrverted: 22
Intuitive: 38
Thinking: 12
Judging: 33

My Exam

posted Sep 2, 2010, 5:49 PM by Unknown user

Here we are again...same place with the same mark. Well at least I know I tried hard, but of course i can do better. Multiple choice was bad as usual.. but my short answer was much worse.. especially towards the end. Hence, I am doubting whether i should be here..anyways..
 
Multiple choice:
Data compression is where i lost 2 marks, and i should have been able to realise where bias is bias in Q20.
 
Short answer:
One of the questions involved JUSTIFYING a question instead what i did (accounting). Hence, I did not answer the question effectively and revolved my answers over the wrong concept. The next question is where i did not recognise the IMPLICATIONS of data ownership. the word "implications" tossed me off-track, and my answer again did not revolve around the right concept. The next mistake is the mistake in colour, where i did not UNDERSTAND the question again. The wording of this question is what put me off (again). The last question is another example of NOT UNDERSTANDING THE QUESTIONS. I made points but did not point out the reasons to back these questions up.
 
To improve:
I believe that UNDERSTANDING the question was my problem. Hopefully i will do better next time. I should try simplify things more, and obviously study LESS. haha

IPT Audacity Recordings

posted Jun 7, 2010, 5:36 AM by Unknown user

We spent today's lesson recording our voices into audacity and adding effects on it. My original and changed files are below with the attachments:
 
  1. Original
  2. Fade in & Fade out: the value of the mp3 file decreases/increases on either side of the  file. The decibels increase or decrease.
  3. Change pitch: Changed pitch to Gb/F sharp. The pitch was made higher as the frequency higher.
  4. Change speed high: This increased the pitch and reduced time. The data o or some of the samples have been removed at regular intervals and the remaining samples have been shuffled back together
  5. Change pitch low: Samples in the file have been both stretched and new ones have been created
  6. Echo: duplicates the data so it is being read twice and gradually decreasing the decibels so it fades out.

IPT H/Y Marks

posted May 9, 2010, 5:31 PM by Unknown user

Well..disappointed is the only thing that can be said about my mark for the exam. I studied..really hard and all but what i learn is that how I studied is just not necessary for this subject. IPT is APPLICATION not knowledge unlike say Ag.
 
Where'd I go wrong? My MCQ was crap, then the system diagram was horrible too. If these were improved, my mark would have shot up quite a bit more.
 
I believe i am still on the first step of Bloom's pyramid and have a long way to go... ==

Biometric Security Card Issues

posted May 5, 2010, 4:40 PM by Unknown user

The Australian Federal Government wants to combine all personal information (eg passport, drivers' license, medicare, bank account, Facebook) into one biometrically secure identity card. This would involve all data being centrally available to the government.
 
Outline all the social and ethical issues that this raises, and describe three in detail.
 
As a result of the combination of all personal combination into a single biometric card, there a variety of social and ethical issues which will be raised.
 
Such include the change in nature of work. As a result of the new biometric cards, everyone will have to switch from the current passport, credit card and bank account system to these new cards. If the transition is not conducted efficiently, there will be inconvenience issues to much of the Australian and global public.
 
Data security will become an issue as so far or to our knowledge, there is no proof that only appropriate data will be processed, everything else withheld. For example, a bank transaction should not be able to take up data based on your passport.
 
Privacy may also become an ethicial issue which may be brought up through this proposal. The federal government's proposal for the variety of data all to be placed in the one card can clash with privacy issues as data which is not necessary for a certain transaction may be taken, something such as credit card details which may be private information to the owner.

HTML

posted Mar 24, 2010, 2:06 PM by Unknown user

HTML is text interpreted by software and displayed according to the meaning of various tags. CLICK ME

 
 

Audio and Video Type

posted Mar 14, 2010, 4:46 PM by Unknown user

Frequency and Amplitude make up sound:
        Frequency is measured in 'hertz' (Hz)
        Amplitude is measured in 'decibels' (dB)
 
There are two methods to represent sound:
  • Sampling: Sampling is sound recorded at precise and regular

Image Data Type

posted Mar 11, 2010, 3:34 PM by Unknown user

There are two main methods for storing image data (e.g. Screenshots, Graphics on a website, Photographs):
  1. Bitmap: in a bitmap, each individual dot is stored separately. Individual dots are called picture element or pixel for short. The file size of a bitmap depends on two things; the number of pixels and the number of colours. When the number of colours is reduced on a bitmap, an effect is observed called banding.
  2. Vector: rather than storing individual dots, a vector image mathematically represents the shapes and colours in an image. Vector images are almost always vastly smaller than bitmap images. 

1-10 of 14