A random variable $X$ is called continuous if its set of possible values is uncountable, and the chance that it takes any particular value is zero ($\text{P}(X = x) = 0$ for every real number $x$). A random variable is continuous if and only if its cumulative probability distribution function is a ...
0
votes
1answer
18 views
Expected value of continuous probability distribution
I'm using statsmodels.nonparametric.KDEMultivariate to generate continuous probability distributions with kernel density estimation. The distribution is created ...
2
votes
1answer
21 views
Family distribution for continuous count data
I need to model the variable Total motile Count which describe how many million sperm cells in an entire ejaculate are motile. It is not a proper count since it is calculated as a product of other ...
0
votes
0answers
23 views
Choosing between independent t-test and multinomial regression
I have two variables: one dichotomous variable (0-1) and a continuous variables (utilities from a conjoint study). I now want to measure how the continuous influences the utilities. In other words: do ...
0
votes
0answers
16 views
Significance Testing of Means
How do I calculate the sample size required to design a test to assess lift in sales made by test and control groups? I know actual significance study will be based on actual means and standard ...
2
votes
0answers
39 views
Artificial Neural Network with continuous and binary variables
I have a dataset with numerical (continuous) and categorical variables. I want to fit an artificial neural network. To do so, I have transformed my categorical variables by using the 1-of-k method, so ...
4
votes
2answers
308 views
Common Continuous Distributions with [0,1] support
Question
I am looking to understand what possible common statistical continuous distributions exist with support [0,1].
Background
In my work I often come across data which are bounded between 0 ...
2
votes
0answers
24 views
Moving from discrete sum of changes to continuous integral of local covariance - how is this done?
I'm trying to derive a specific relationship about the relationship between forwards and futures. The expression is from the paper, "The relationship between forward and futures prices", written 1981 ...
3
votes
1answer
35 views
Understanding how to apply goodness of fit tests when parameters of a continuous non-normal distribution have been fitted to the data
I have some data and wish to fit several distributions to it, many of which are compound and/or complex. I'd like to know whether a given family of distributions is appropriate which I can apparently ...
0
votes
0answers
11 views
Goodness of fit for point processes instead of continuous processes
Why standard distance discrepancy measures applied in continuous data analyses, such as the average sum of squared deviations between recorded data values and estimated values from the model, cannot ...
1
vote
0answers
9 views
Clustered analysis with mclust and various type of variables [migrated]
After several hours spend on various blogs, I decided to explicitly ask my question on this website. I try to clustered my data with different methods. I already done the work with hierarchical ...
0
votes
1answer
18 views
Binary and continuous variable conversion for neural network analyses
I am a little confused about how to handle binary variables and continuous variables before being fed into a neural network in R. Please can you confirm that I should normalize all variables to fall ...
0
votes
0answers
27 views
Regression Options With Categorical X values and continuous data
I am visualizing average biological parameters (i.e., weights, lengths, age, condition, etc.) over time (specifically across years) using geom_boxplot in ggplot in R. I have also fit a trend line ...
0
votes
0answers
15 views
Discrete and Continuous Components of a Distribution Function
Can anyone provide me with any hint(s) as to how can I find the discrete and continuous components of the following distribution function, F(x) ?
$$F(x) = 0, \;\;\;\;x < 1 \\
= \frac{1}{2} +...
1
vote
0answers
15 views
How to examine the relationship between a multiple response categorical variable and a continuous outcome?
I have a questionnaire that asked people to endorse the top three types of internet based activity they engage in (binary yes/no for each category of activity, 22 categories). I want to examine ...
0
votes
1answer
54 views
Treating continuous variables
I attended a conference on ML and Data Science and I have a general question that was not answered in the conference.
If we have a continuous variable, let's say age. What is the best way to handle ...
1
vote
0answers
37 views
When does a continuous distribution become a discrete one?
If I have a continuous probability distribution, say normal, it's clearly theoretically continuous and values can be anywhere within it's range (+/- infinity I guess to infinite precision).
I then ...
0
votes
0answers
16 views
Mutual information for numerical data
My question is if is possible to compute the mutual information I(A;B) between two numerical attributes(A and B contains real data). I cant find solution for whole ...
-3
votes
1answer
44 views
linear regression with two categorial independent variables and a continuous dependent variable
I measured the amount of inflammation among patients as a categorical variable (mild/moderate/strong), I also measured the amount of disability of the patients using a questionnaire (score 0-23) 8 ...
0
votes
1answer
30 views
Can an ordinal variable such as Bond Rating reasonably be regressed against a continuous variable such as rate of return?
I encountered an analysis in which the analyst claims to have regressed ordinal values for bond ratings assign on a scale of 1 to 21 with 1 representing the highest rated bonds and 21 representing the ...
1
vote
0answers
17 views
Estimating Continuous latent variables in a general Bayesian network
I am working on a problem where every node in the Bayesian network is continuous random variable and the structure of the network is known. This network comprises of both observed and latent nodes, ...
1
vote
1answer
35 views
How to calculate likelihood for simple linear regression, why it is not zero?
I am confused with the likelihood for simple linear regression, in this note it says
$$
\large \prod_{i=1}^n p(y_i \mid x_i; \beta_0,\beta_1,\sigma^2) = \prod_{i=1}^n \frac 1 {\sqrt{2\pi\sigma^2}} e^{...
0
votes
1answer
57 views
Regression random forest and highly skewed response distribution
There is a great deal of information on how unbalanced data sets may impact predictive accuracy in classification problems. Several solutions have been proposed (see here). My questions are:
Can a ...
0
votes
0answers
27 views
Clustering mixed dependent variables
What is the suitable algorithm for clustering variables that are both continuous and categorical? I would use two-step clustering, but since it has dependent variable, it is not suitable.
2
votes
3answers
81 views
Integer Data: Categorical or Continuous?
I am wondering if integer predictor data should be treated as categorical (thus requiring encoding) or continuous. For example, if the range of a given predictor X ...
0
votes
1answer
79 views
Should I delete one year with small sample size from time series analysis?
I hope you can help me with this question:
I have a time series data (25 years) that I will analyze to find temporal changes on seasonality over time. I am using linear regression and my model ...
1
vote
0answers
26 views
Show probability that P(N > k) = 1/k for independent and identically distributed continuous random variables [closed]
Let $X_{1}, X_{2}, ...$ be independent and identically distributed continuous random variables.
Let $N$ be the smallest value of n for which $X_{n} > X_{1}$.
a) Show that $P(N > k) = \frac{1}{k}...
0
votes
1answer
55 views
Making discrete data “continuous” for time series analysis
I would like to make a time series model to forecast the number of reported car crashes in my local area. The data I have available is a log of when the accident was reported to the local dispatch ...
0
votes
0answers
15 views
What statistical test for an A/B split using revenue and impressions? (Continuous Vars.)
I want to conduct an A/B test between two different Ad Copies I have for a given advertisement on Google.
Ad Copy A might have 30 k impressions (number of times ad was displayed) with approximately $...
0
votes
1answer
23 views
Test between a Yes/No independent variable and a continuous Age variable?
Sorry if this is very easy but I have been struggling all day.
I have some data where I know the ages of 16 subjects:
3,3,4,4,4,5,5,5,5,5,5,5,6,6,7,9
and the corresponding answer:
Y,Y,Y,N,N,Y,Y,Y,...
1
vote
0answers
32 views
What statistical test? DV: Categorical IV: Continuous
This is for a university essay.
The data that I would like to compare are General Self Rated Attractiveness (GSA) and Sexual Orientation Index (SOI). There are 203 individuals with each with an SOI ...
0
votes
0answers
51 views
Categorical Model: Ordinal, Nominal and Continuous Independent and Ordinal 3 option dependent
I have a large data set ($n=1100$) where my dependent variable is ordinal with 3 options that I've recoded in the following order: excited; mixed feelings; wish I didn't.
My research question is as ...
0
votes
2answers
40 views
Expectation of 2 functions with one random variable
This may be a trivial question but I want to consult with you all.
Let U be a continuous random variable taking values int he interval [0,2pi]. Let X = cos(U), Y = sin(U). Determine the Pearson ...
0
votes
1answer
27 views
How to set up ANCOVA with two categorical variables?
I have a dataset of Geese body masses in two locations over a 100 day study period. I am interested in examining how mean body mass changes over study period and if there is a difference between the ...
0
votes
0answers
15 views
Sum of squared error and point process data
One standard distance measure applied in continuous data analysis is average sum of squared error. However, this method cannot be applied for point process data. And there is an alternative solution ...
0
votes
1answer
28 views
Calculating agreement between 3 users with continuous data
I have a dataset of consisting of volume measurements as taken by 3 users in two trials. That is to say, each user rated each point twice, therefore I have 6 measurements in total for each point. I ...
1
vote
1answer
33 views
Joint PDF of 2 continuous dependent RV
I am trying to calculate mutual information on two observed continuous variables X and Y, which I believe to be dependent.
The formula relies on p(x,y): the joint probability density function of X ...
0
votes
0answers
36 views
Continuous data discretization rules
I know it all depends on the data, but I am looking for a general, most common rules for continuous data discretization.
For example It could be a list like this:
Use Supervised MDL discretization (...
0
votes
0answers
11 views
Interpretation of and odds ratio with continuous IV [duplicate]
I am looking at the relationship between tenure and turnover. I did not do the analysis and do not have access to the original output or data. The only information I have is this: with each more year ...
2
votes
0answers
27 views
Continuous entropy comparison
I have a continuos time signal (speech signal) and I will add noise to it at different SNRs. I want to compare the entropy or the original signal (clean speech) with the noisy ones. The idea is to ...
0
votes
0answers
47 views
Proportional dependent variable [0,1]: continuous verus count distribution
I want to model a proportional variable bounded by [0,1] (the % of land fertilized). A high percentage of the data contains 0s, a smaller percentage contains 1s, and all the rest falls in between.
My ...
1
vote
3answers
78 views
Understanding symmetry of i.i.d continous random variable
I am reading Introduction to Probability by Joseph K. Blitzstein and Jessica Hwang, which states that Continuous r.v.s that are i.i.d have all possible ranking equally likely. In the proof, it is ...
0
votes
0answers
22 views
Interpretation: two variables interacting with the same moderator
In a data-set the variables are as follow:
X1: Independent Variable 1 (continuous)
X2: Independent Variable 2 (continuous)
M: Moderator (Categorical, 2 Levels: I and U)
Y: Dependent Variable
...
1
vote
1answer
42 views
How to choose analyzing method using discrete and continuous variables?
This is my first work, doing it for university. I have nominal, ordinal and numeric (or what they're called in English) variables. I think I finally figured out I have to name them discrete and ...
0
votes
0answers
19 views
Independent numeric variable with two levels
I am studying whether two independent variables (frequency $freq$, and time $t$) influence a (continuous) output variable ($HR$). The independent variables are numeric. However, the experiment tests ...
1
vote
0answers
20 views
Test difference between totals over periods of different length
I have totals for two periods of different lengths (previous month vs month-to-date) and want to test whether they are significantly different.
E.g.
1.-30. Nov 2016 = 102.3
1.-13. Dec 2016 = 41
If ...
1
vote
0answers
11 views
Which test to use for associations between 1 IV (attribute nominal) and 1 DV (continuous)?
The assumption is there is no relationship between organizational industry association (independent nominal) and technology adoption rates (months, dependent continuous). What test to use to test this ...
1
vote
0answers
40 views
Meta analysis of continuous outcome but different follow up time
I am looking at different studies and the outcome is a percent reduction from baseline. The studies all have different follow up times. If I simply control for follow up time, that may be biased since ...
0
votes
0answers
54 views
Cross Validation Split Data to test, train and validation datasets + Discretization
I need an advice which portion of a dataset should be used to calculate cuts for discretization. I use two levels of Cross Validation. One is external to the model creating, but the second is used ...
0
votes
0answers
32 views
How to find the value for which my continuous variable stops being significant?
I have a continuous distance measure as an explanatory variable (a property's distance to an amenity, ranging from 0 to 60 km). How, if possible at all, can I find the value for which distance to an ...
0
votes
1answer
16 views
Expressing a regression estimate when predictor is continuous and between 0 and 1
I am regressing a count variable (adverse consequences) against a continuous predictor variable between 0 and 1 where all values are beween 0 and 0.5 (blood alcohol.
I am using a hierarchical ...