Hacker Read

gnufx · 2020-08-26 21:24:25+00:00

> Why do you think that?

Observation in research support, I'd guess. It typically no longer seems to be the case that you do whatever you need to for your data.

reply

digler999 | karma 1401 | avg karma 1.95 · | 2016-12-20 12:03:44

> what is the purpose of offering your anecdotal experience if you acknowledge that it is an n=1 datapoint?

What is the purpose of using large-n datapoints anymore when the primary motivation of "research" is becoming less about finding meaningful information and more about getting published (in for-profit, paywalled "journals") so you can get more grant money for your institution?

Haven't you seen the articles in the last year or about little scientific research is even reproducible anymore ? You can just repeat your experiment until you get the results you want to see.

reply

MaxBarraclough | karma 10788 | avg karma 2.11 · | 2018-09-27 08:39:18+00:00

> I still think that science should be automated in some way, shape or form.

Meaning what?

> I'm imagining something like you have a dataset and you have to upload that dataset to some third party that checks it for it's validity.

The results are what they are. What is 'validity' meant to mean?

> Of course, this is a completely silly idea but I'd love to know if someone has like any tangential related thoughts on this

Quantitative studies are already published with the proper analyses, which are invariably produced 'automatically' using software, not manual methods.

I imagine there might be some value in publishing raw data, though. There may sometimes be questions like privacy, but I don't imagine they'll always be show-stoppers.

reply

Frost1x | karma 5611 | avg karma 4.09 · | 2023-10-13 12:32:51

>From my understanding, you can't draw any real conclusions from the data unless you predicted that it would look that way beforehand.

Oh my sweet summer child, a lot of lab work and data collection is expensive and in the game of research, you spend a lot of time gaming the system and meeting expectations relative to doing actual fundamental research. So much work wants to take the hard low return work like doing tests and collecting data and releveraging it with increasingly complex statistical approaches.

I've worked in so many research environments that you find more often the case is that there's a selection towards research that could be pursued and falsified with existing data vs the other way around. Here's this set of data and how it was collected, what arbitrarily new novel thing can we say about it? It may not be something interesting but it may be statistically or theoretically valid. The result is you get a paper/publication out of it without doing the footwork.

This is part of the reason researchers often hold their data tightly. You'd think scientists would want to share data but it's a highly competitive environment and if you took the risk to invest time and money in some costly data collection process, you want to do everything you can to say everything you can about it before someone else does it without any of the underlying cost. Sure, you may get a reference or footnote for your data but that's not going to help that much in the big scheme of things, not as much as a fresh publication. Also, if you're only being referenced for the data collection portion of your work... it doesn't speak alot about the work you did around that data collection.

reply

itsoktocry | karma 6392 | avg karma 3.21 · | 2022-07-29 08:54:20

>Data is useful for proving my decisions once I have made them

I can't figure out what this means, but I have a feeling it's the exact opposite way you should be incorporating data into your decision making.

reply

UVB-76 | karma 1852 | avg karma 4.02 · | 2014-08-12 16:55:50+00:00

> I disagree, the data exists, this is fundamental, we are not going towards data impoverishment, we are necessarily living in an environment rich with data. I rather have accurate data by far.

It may be an inexorable trend, but for the time being, individuals and organizations generally have a choice about the amount of data they collect.

reply

_pmf_ | karma 2981 | avg karma 1.06 · | 2015-08-03 02:25:29

> I took a look at the data. The data schema is disorganized to the point that a lot of janitorial work would be necessary to get it useable and perform any analysis or visualization.

In other words, it is real world data.

reply

YeGoblynQueenne | karma 22041 | avg karma 2.5 · | 2024-02-10 01:29:23

>> So you are certainly aware that there are avenues to creating the data set. Given that, it is quite reasonable to say that search is unnecessary.

How is it unnecessary? They used none of those methods, so they had to use search. That is search being necessary, not the opposite.

reply

johndubchak | karma 298 | avg karma 2.14 · | 2017-12-11 18:16:51+00:00

>> because if that's the case, why even submit?

Analytical data on which problems you're solving and how you're doing vs how you think you're doing? And who might be interested in purchasing that data...

reply

paulcole | karma 4431 | avg karma 0.78 · | 2022-04-15 10:42:54

> Not saying its a bad idea, but I'm still struggling to see the full logic of it.

You don't have to see the full logic of it! There's data collected and analyzed by people who have studied the problem!

reply

fumbly | karma 21 | avg karma 4.2 · | 2021-05-17 21:33:33+00:00

> No, all the choices have already been made. The data have already been destroyed

You don't know that. You'd need an investigation to conclude that. Using it as an excuse not to investigate seems like assuming the conclusion.

Even if some data have been destroyed, it doesn't follow that every last piece of data everywhere has. Who knows what might turn out to be significant? There may well be relevant data in many countries, too, since the research was international.

reply

tripzilch | karma 4613 | avg karma 1.12 · | 2020-12-18 14:41:43

> OTOH: you're presumably holding the non-aggregated data for aggregation purposes in the first place. IANAL but I think that needs consent.

I believe this is actually fine if they can show they don't hold this data longer than necessary and have a process for destroying it in a timely fashion.

But IANAL either

reply

bouncing | karma 560 | avg karma 3.41 · | 2024-02-11 08:10:49

> If I had other intensions [sic] and did not believe in my imputation approach, I would not share the data with him.

Admittedly this isn’t my field. But that sounds a lot like “I only show my work to people who will validate it,” does it not?

reply

enraged_camel | karma 16714 | avg karma 2.78 · | 2015-04-06 23:51:56

>>All we need is the ability to capture, analyze, and start applying the data.

Who owns the data?

reply

chii | karma 16512 | avg karma 1.96 · | 2013-09-28 07:04:46+00:00

> The fact is, most people aren't qualified to interpret the data.

so what if most people isn't qualified? Data is data, and can be used - even for crackpots who wants to use it. If these crackpots publishes something wrong, i m sure they'd be pointed out and either ignored or shunned by the publisher(s) anyway.

reply

IndianAstronaut | karma 1451 | avg karma 1.47 · | 2015-05-12 03:06:18

>Not a whole lot of thought went into the validity of the data or its analysis.

They may not be able to analyze the data themselves, but there are third party companies which have the expertise to make something of that data.

reply

tree_of_item | karma 2524 | avg karma 2.9 · | 2018-12-16 11:49:48

> Am I the only one who finds it concerning that they capture like everything and persisting it just for the sake of having it and finding a use for it later?

I am pretty sure this has been a standard practice for at least a decade now. Isn't that what the "big data" meme is about? Store everything, because you can always get more computational power and statistical techniques to extract value from it later on.

reply

jjtheblunt | karma 4559 | avg karma 1.2 · | 2024-06-23 00:36:30

> I doubt this is the case or it has any side effects on their performance.

"Show me the data!"

reply

nomel | karma 8158 | avg karma 2.06 · | 2021-09-02 19:05:13

> However, researchers (not me) are already looking for ways to extract data from the libraries. I think it's only a matter of time before this becomes a much bigger problem.

What's the use of the data, and what problems do you see?

reply

kumarvvr | karma 6431 | avg karma 3.35 · | 2022-03-01 19:51:50

> But data seems to confirm it.

Can you share this data. I am curious.

reply