A random variable $X$ is called continuous if its set of possible values is uncountable, and the chance that it takes any particular value is zero ($\text{P}(X = x) = 0$ for every real number $x$). A random variable is continuous if and only if its cumulative probability distribution function is a ...

learn more… | top users | synonyms (1)

0
votes
1answer
18 views

Expected value of continuous probability distribution

I'm using statsmodels.nonparametric.KDEMultivariate to generate continuous probability distributions with kernel density estimation. The distribution is created ...
2
votes
1answer
21 views

Family distribution for continuous count data

I need to model the variable Total motile Count which describe how many million sperm cells in an entire ejaculate are motile. It is not a proper count since it is calculated as a product of other ...
0
votes
0answers
23 views

Choosing between independent t-test and multinomial regression

I have two variables: one dichotomous variable (0-1) and a continuous variables (utilities from a conjoint study). I now want to measure how the continuous influences the utilities. In other words: do ...
0
votes
0answers
16 views

Significance Testing of Means

How do I calculate the sample size required to design a test to assess lift in sales made by test and control groups? I know actual significance study will be based on actual means and standard ...
2
votes
0answers
39 views

Artificial Neural Network with continuous and binary variables

I have a dataset with numerical (continuous) and categorical variables. I want to fit an artificial neural network. To do so, I have transformed my categorical variables by using the 1-of-k method, so ...
4
votes
2answers
308 views

Common Continuous Distributions with [0,1] support

Question I am looking to understand what possible common statistical continuous distributions exist with support [0,1]. Background In my work I often come across data which are bounded between 0 ...
2
votes
0answers
24 views

Moving from discrete sum of changes to continuous integral of local covariance - how is this done?

I'm trying to derive a specific relationship about the relationship between forwards and futures. The expression is from the paper, "The relationship between forward and futures prices", written 1981 ...
3
votes
1answer
35 views

Understanding how to apply goodness of fit tests when parameters of a continuous non-normal distribution have been fitted to the data

I have some data and wish to fit several distributions to it, many of which are compound and/or complex. I'd like to know whether a given family of distributions is appropriate which I can apparently ...
0
votes
0answers
11 views

Goodness of fit for point processes instead of continuous processes

Why standard distance discrepancy measures applied in continuous data analyses, such as the average sum of squared deviations between recorded data values and estimated values from the model, cannot ...
1
vote
0answers
9 views

Clustered analysis with mclust and various type of variables [migrated]

After several hours spend on various blogs, I decided to explicitly ask my question on this website. I try to clustered my data with different methods. I already done the work with hierarchical ...
0
votes
1answer
18 views

Binary and continuous variable conversion for neural network analyses

I am a little confused about how to handle binary variables and continuous variables before being fed into a neural network in R. Please can you confirm that I should normalize all variables to fall ...
0
votes
0answers
27 views

Regression Options With Categorical X values and continuous data

I am visualizing average biological parameters (i.e., weights, lengths, age, condition, etc.) over time (specifically across years) using geom_boxplot in ggplot in R. I have also fit a trend line ...
0
votes
0answers
15 views

Discrete and Continuous Components of a Distribution Function

Can anyone provide me with any hint(s) as to how can I find the discrete and continuous components of the following distribution function, F(x) ? $$F(x) = 0, \;\;\;\;x < 1 \\ = \frac{1}{2} +...
1
vote
0answers
15 views

How to examine the relationship between a multiple response categorical variable and a continuous outcome?

I have a questionnaire that asked people to endorse the top three types of internet based activity they engage in (binary yes/no for each category of activity, 22 categories). I want to examine ...
0
votes
1answer
54 views

Treating continuous variables

I attended a conference on ML and Data Science and I have a general question that was not answered in the conference. If we have a continuous variable, let's say age. What is the best way to handle ...
1
vote
0answers
37 views

When does a continuous distribution become a discrete one?

If I have a continuous probability distribution, say normal, it's clearly theoretically continuous and values can be anywhere within it's range (+/- infinity I guess to infinite precision). I then ...
0
votes
0answers
16 views

Mutual information for numerical data

My question is if is possible to compute the mutual information I(A;B) between two numerical attributes(A and B contains real data). I cant find solution for whole ...
-3
votes
1answer
44 views

linear regression with two categorial independent variables and a continuous dependent variable

I measured the amount of inflammation among patients as a categorical variable (mild/moderate/strong), I also measured the amount of disability of the patients using a questionnaire (score 0-23) 8 ...
0
votes
1answer
30 views

Can an ordinal variable such as Bond Rating reasonably be regressed against a continuous variable such as rate of return?

I encountered an analysis in which the analyst claims to have regressed ordinal values for bond ratings assign on a scale of 1 to 21 with 1 representing the highest rated bonds and 21 representing the ...
1
vote
0answers
17 views

Estimating Continuous latent variables in a general Bayesian network

I am working on a problem where every node in the Bayesian network is continuous random variable and the structure of the network is known. This network comprises of both observed and latent nodes, ...
1
vote
1answer
35 views

How to calculate likelihood for simple linear regression, why it is not zero?

I am confused with the likelihood for simple linear regression, in this note it says $$ \large \prod_{i=1}^n p(y_i \mid x_i; \beta_0,\beta_1,\sigma^2) = \prod_{i=1}^n \frac 1 {\sqrt{2\pi\sigma^2}} e^{...
0
votes
1answer
57 views

Regression random forest and highly skewed response distribution

There is a great deal of information on how unbalanced data sets may impact predictive accuracy in classification problems. Several solutions have been proposed (see here). My questions are: Can a ...
0
votes
0answers
27 views

Clustering mixed dependent variables

What is the suitable algorithm for clustering variables that are both continuous and categorical? I would use two-step clustering, but since it has dependent variable, it is not suitable.
2
votes
3answers
81 views

Integer Data: Categorical or Continuous?

I am wondering if integer predictor data should be treated as categorical (thus requiring encoding) or continuous. For example, if the range of a given predictor X ...
0
votes
1answer
79 views

Should I delete one year with small sample size from time series analysis?

I hope you can help me with this question: I have a time series data (25 years) that I will analyze to find temporal changes on seasonality over time. I am using linear regression and my model ...
1
vote
0answers
26 views

Show probability that P(N > k) = 1/k for independent and identically distributed continuous random variables [closed]

Let $X_{1}, X_{2}, ...$ be independent and identically distributed continuous random variables. Let $N$ be the smallest value of n for which $X_{n} > X_{1}$. a) Show that $P(N > k) = \frac{1}{k}...
0
votes
1answer
55 views

Making discrete data “continuous” for time series analysis

I would like to make a time series model to forecast the number of reported car crashes in my local area. The data I have available is a log of when the accident was reported to the local dispatch ...
0
votes
0answers
15 views

What statistical test for an A/B split using revenue and impressions? (Continuous Vars.)

I want to conduct an A/B test between two different Ad Copies I have for a given advertisement on Google. Ad Copy A might have 30 k impressions (number of times ad was displayed) with approximately $...
0
votes
1answer
23 views

Test between a Yes/No independent variable and a continuous Age variable?

Sorry if this is very easy but I have been struggling all day. I have some data where I know the ages of 16 subjects: 3,3,4,4,4,5,5,5,5,5,5,5,6,6,7,9 and the corresponding answer: Y,Y,Y,N,N,Y,Y,Y,...
1
vote
0answers
32 views

What statistical test? DV: Categorical IV: Continuous

This is for a university essay. The data that I would like to compare are General Self Rated Attractiveness (GSA) and Sexual Orientation Index (SOI). There are 203 individuals with each with an SOI ...
0
votes
0answers
51 views

Categorical Model: Ordinal, Nominal and Continuous Independent and Ordinal 3 option dependent

I have a large data set ($n=1100$) where my dependent variable is ordinal with 3 options that I've recoded in the following order: excited; mixed feelings; wish I didn't. My research question is as ...
0
votes
2answers
40 views

Expectation of 2 functions with one random variable

This may be a trivial question but I want to consult with you all. Let U be a continuous random variable taking values int he interval [0,2pi]. Let X = cos(U), Y = sin(U). Determine the Pearson ...
0
votes
1answer
27 views

How to set up ANCOVA with two categorical variables?

I have a dataset of Geese body masses in two locations over a 100 day study period. I am interested in examining how mean body mass changes over study period and if there is a difference between the ...
0
votes
0answers
15 views

Sum of squared error and point process data

One standard distance measure applied in continuous data analysis is average sum of squared error. However, this method cannot be applied for point process data. And there is an alternative solution ...
0
votes
1answer
28 views

Calculating agreement between 3 users with continuous data

I have a dataset of consisting of volume measurements as taken by 3 users in two trials. That is to say, each user rated each point twice, therefore I have 6 measurements in total for each point. I ...
1
vote
1answer
33 views

Joint PDF of 2 continuous dependent RV

I am trying to calculate mutual information on two observed continuous variables X and Y, which I believe to be dependent. The formula relies on p(x,y): the joint probability density function of X ...
0
votes
0answers
36 views

Continuous data discretization rules

I know it all depends on the data, but I am looking for a general, most common rules for continuous data discretization. For example It could be a list like this: Use Supervised MDL discretization (...
0
votes
0answers
11 views

Interpretation of and odds ratio with continuous IV [duplicate]

I am looking at the relationship between tenure and turnover. I did not do the analysis and do not have access to the original output or data. The only information I have is this: with each more year ...
2
votes
0answers
27 views

Continuous entropy comparison

I have a continuos time signal (speech signal) and I will add noise to it at different SNRs. I want to compare the entropy or the original signal (clean speech) with the noisy ones. The idea is to ...
0
votes
0answers
47 views

Proportional dependent variable [0,1]: continuous verus count distribution

I want to model a proportional variable bounded by [0,1] (the % of land fertilized). A high percentage of the data contains 0s, a smaller percentage contains 1s, and all the rest falls in between. My ...
1
vote
3answers
78 views

Understanding symmetry of i.i.d continous random variable

I am reading Introduction to Probability by Joseph K. Blitzstein and Jessica Hwang, which states that Continuous r.v.s that are i.i.d have all possible ranking equally likely. In the proof, it is ...
0
votes
0answers
22 views

Interpretation: two variables interacting with the same moderator

In a data-set the variables are as follow: X1: Independent Variable 1 (continuous) X2: Independent Variable 2 (continuous) M: Moderator (Categorical, 2 Levels: I and U) Y: Dependent Variable ...
1
vote
1answer
42 views

How to choose analyzing method using discrete and continuous variables?

This is my first work, doing it for university. I have nominal, ordinal and numeric (or what they're called in English) variables. I think I finally figured out I have to name them discrete and ...
0
votes
0answers
19 views

Independent numeric variable with two levels

I am studying whether two independent variables (frequency $freq$, and time $t$) influence a (continuous) output variable ($HR$). The independent variables are numeric. However, the experiment tests ...
1
vote
0answers
20 views

Test difference between totals over periods of different length

I have totals for two periods of different lengths (previous month vs month-to-date) and want to test whether they are significantly different. E.g. 1.-30. Nov 2016 = 102.3 1.-13. Dec 2016 = 41 If ...
1
vote
0answers
11 views

Which test to use for associations between 1 IV (attribute nominal) and 1 DV (continuous)?

The assumption is there is no relationship between organizational industry association (independent nominal) and technology adoption rates (months, dependent continuous). What test to use to test this ...
1
vote
0answers
40 views

Meta analysis of continuous outcome but different follow up time

I am looking at different studies and the outcome is a percent reduction from baseline. The studies all have different follow up times. If I simply control for follow up time, that may be biased since ...
0
votes
0answers
54 views

Cross Validation Split Data to test, train and validation datasets + Discretization

I need an advice which portion of a dataset should be used to calculate cuts for discretization. I use two levels of Cross Validation. One is external to the model creating, but the second is used ...
0
votes
0answers
32 views

How to find the value for which my continuous variable stops being significant?

I have a continuous distance measure as an explanatory variable (a property's distance to an amenity, ranging from 0 to 60 km). How, if possible at all, can I find the value for which distance to an ...
0
votes
1answer
16 views

Expressing a regression estimate when predictor is continuous and between 0 and 1

I am regressing a count variable (adverse consequences) against a continuous predictor variable between 0 and 1 where all values are beween 0 and 0.5 (blood alcohol. I am using a hierarchical ...