Write My Paper Button

WhatsApp Widget

Your work should answer the question: Does the psychological predisposition to drug consumption exist? Nowadays, after many years of research and development, psychologists have largely

Data Mining and Neural Networks Computational Task 1 | University of Leicester

Category Assignment Subject Computer Science
University University of Leicester Module Title Data Mining and Neural Networks

Your work should answer the question: Does the psychological predisposition to drug consumption exist?

Nowadays, after many years of research and development, psychologists have largely agreed that the personality traits of the modern Five Factor Model (FFM) constitutes the most comprehensive and adaptable system for un- derstanding human individual differences. The FFM comprises Neuroticism (N), Extraversion (E), Openness to Experience (O), Agreeableness (A), and Conscientiousness (C).

The five traits can be summarized thus:

  • N Neuroticism is a long-term tendency to experience negative emotions such as nervousness, tension, anxiety and depression (associated adjectives: anxious, self-pitying, tense, touchy, unstable, and worrying);
  • E Extraversion manifested in characters who are outgoing, warm, active, assertive, talkative, and cheerful; these persons are often in search of stimulation (associated adjectives: active, assertive, energetic, enthusiastic, outgo- ing, and talkative);
  • O Openness to experience is associated with a general appreciation for art, unusual ideas, and imaginative, cre- ative, unconventional, and wide interests (associated adjectives: artistic, curious, imaginative, insightful, original, and wide interest);
  • A Agreeableness is a dimension of interpersonal relations, characterized by altruism, trust, modesty, kindness, compassion and cooperativeness (associated adjectives: appreciative, forgiving, generous, kind, sympathetic, and trusting);
  • C Conscientiousness is a tendency to be organized and dependable, strong-willed, persistent, reliable, and efficient (associated adjectives: efficient, organised, reliable, responsible, and thorough).
    Two additional characteristics of personality are proven to be important for analysis of substance use, Impulsivity (Imp) and Sensation-Seeking (SS).
  • Imp Impulsivity is defined as a tendency to act without adequate forethought;
  • SS Sensation-Seeking is defined by the search for experiences and feelings, that are varied, novel, complex and intense, and by the readiness to take risks for the sake of such experiences.
  • Seven psychological traits were used to characterise the participants: N, E, O, A, C, Imp, and SS.

Task 0. Preparation data for analysis

The dataset is online https://leicester.figshare.com/articles/dataset/Drug_consumption_ database_quantified_categorical_attributes/7588409
Database description is available at
https://leicester.figshare.com/articles/dataset/Drug_consumption_ database_description/7588412

There are much more attributes than you need. Prepare the table. For every participant, leave the following information: 7 psychological traits and nicotine user/non-user (in the last year).
 
The user/non-user classification will be the main task.

Task 1. Descriptive statistics 

For both classes (users and non-users) find the mean values of the 7 attributes and their stan- dard deviations. Evaluate the 95% confidence intervals for mean values. (Take the definitions from any elementary textbook in statistics. A very simple online tutorial about 95% confidence interval is here: http://www.itl.nist.gov/div898/handbook/eda/section3/eda352.htm

A very simple textbook, The Little Handbook of Statistical 

Practice, is here: https://forum.disser.ru/index.php?act=attach&type=post&id=638.

Create graphical illustration (“psychological profiles” of nicotine users and non-users with con- fidence intervals).

Task 2. Significance of differences 

Report, which differences between these means for users and non-users are significant. For significance evaluation use p-values.

Task 3. One attribute classifier 

Try to create predictors user/non-user by one attribute (7 such predictors). For this purpose, create histograms for each attribute and each class and select the best threshold for each at- tribute x for the decision rule: if x > a then one class (users or non-users) and if x < a then another class (non-users or users) (the optimal cut). Find the classification error for each at- tribute. Which attribute gives the best prediction? Arrange the attributes in their prediction ability.

Task 4. kNN classifier

Test 1NN and 3NN classification rules. Present the classification errors. Which rule is bet- ter?

Task 5. Fisher’s linear discriminant description 

Find in the literature description and explanation of Fisher’s linear discriminant. Read, understand and write a comprehensive description of the algorithm with main formulas and explanation (not more than 1 page!)

Task 6. Fisher’s linear discriminant usage 

Apply Fisher’s linear discriminant to the prepared data set. Analyse the quality of classifi- cation. Compare to 1NN and 3NN methods.

CLAIM YOUR 30% OFF TODAY

X
Don`t copy text!
WeCreativez WhatsApp Support
Our customer support team is here to answer your questions. Ask us anything!
???? Hi, how can I help?