Dependence of time and temperature

onurk83 · February 26, 2022, 12:07pm

Hi Jayqi,

“A simple approach may ignore the time dimension and only consider the ion abundances as a function of temperature. However, there may be nuances to how the sample was heated over time that provides additional information”

I think, we should add time information. Time and temperature are highly correlated but we can not drop the time. Ranges and slopes of time-temperature graphic are different. When we discretize the overall temperature range into bins (of 100 degrees),

(-100, 0] can be first bin for “S0000” .
(-100, 0] may not be same bin for “S0001”, it may correspond to a different bin (ex.(-100,-50]).

We need to add time information. We should convert time and temperature columns into a single column and then, we will discretize the overall temperature range into bins.
What do you think Jayqi?

Thanks.
Onur Koç

onurk83 · February 26, 2022, 1:02pm

I have attached image.

onurk83 · February 26, 2022, 9:45pm

Hi Jayqi,
Let’s look at the differences after adding the effect of time for testbed

Which one is more logical?
Materials increase as time increase for SAM testbed S0754

onurk83 · February 26, 2022, 9:47pm

Same technique for SAM test bed S0755

onurk83 · February 27, 2022, 12:57pm

Hi Jayqi and Friends,

Mass spectrometry graphic should also be informative. We also need to add this information of graphic.

Click the link for more information about Mass Spectrometry.
Mass spectrometry

When I drew this graph (abundance, m/z), I understood how the algorithm works and why we discretize the overall temperature range into bins. This graph represents the last moment of the accumulation process. It may consist of a different number of chemical components. For example, for carbon dioxide and propane, m/z value is same (44). it will accumulate on 44 (m/z). But reason of accumulation 44 (m/z) can be different combinations for carbon dioxide and propane (00,01,10,11) and 44 is not only accumulation point for this two compounds.
So we can decompose the compounds by using other different accumulation points. İf you look the link you can easily understand. We need to find out which components it contains(by the way we know the compounds for train_files S0755 because it has train label.)

By the way we don’t need to learn chemistry or all m/z values for every compounds. By using this graphics we will automatically perform.

Friends, if you have suggestions, I will be happy when you can write.
If there are parts that you think are wrong, please reply.

onurk83 · February 28, 2022, 12:52pm

Hi Jayqi and friends,

I drew it so that it could be better understood in 3D (for S0002).

(for abun_minsub_scaled >=0.001)

Onur Koç.

jayqi · February 28, 2022, 2:49pm

Hi @onurk83,

As a competition administrator, I cannot contribute to discussions about modeling strategy. My role is to answer questions about the rules of the competition and and clarifications about the data or task.

However, you are welcome to continue sharing your thoughts here and other participants may be interested and join in. (Just don’t expect me to provide my opinions.)

I do want to note that if you’re going to add new posts regarding the same subject, please add them as replies to an existing topic rather than creating new topics entirely, so that we can keep the community forum more tidy. I’ve merged your last three other topics into this one.

For relevant domain knowledge on interpreting the data, I encourage you to read the “Understanding EGA-MS data” section on the problem description page.

Vatsal_Mars · March 16, 2022, 3:11am

Hi onurk83, I had a few questions on this. Can we have a chat on these two charts ?

Vatsal_Mars · March 16, 2022, 3:16am

I need to understand the Time - Temperature Combination bin here.

onurk83 · March 16, 2022, 7:44am

Hi Vatsal Mars,

Of Course, we can discuss. I only use pca for time and temp and some graphics are reversed. But ı think, time_temp_bin is not true. You can use linear regression for time and temp. You can find a and b, you can use new_temp.
y = a x + b
x=time (input)
y= new_temp (output)

In this way, you use the temperature values that depend on time.
Onur Koc

onurk83 · March 16, 2022, 5:47pm

Hi,

I wanted to know all the slope and intersection points of time and temp in the training data. I am sharing it with you for additional information.

Slopes and intercest points of time and temp in the training data.

İf you want to ask questions, I am avaliable.

Onur Koç.

Vatsal_Mars · March 16, 2022, 6:16pm

Can you please elaborate on this ?

onurk83 · March 16, 2022, 7:16pm

We have (766) training data. Training data have slightly different range and slopes of time and temp (last figure).

First question that comes to mind,

How will we provide time-temperature compatibility for each data?

Because the abundance values that change with time and temperature need to be processed in the same range bin for every data , for each m/z value.
By using linear regression, convert time and temp into new_temp. Then, I think, we scale new_temp all in the same range, then we can divide new_temp into bins. In this way, we ensure time-temperature compatibility for all data.
By the way, Because SAM testbeds data have nonlinear time-temp function, you should use polynomial regression. For commercial data, you can also use poly.

Vatsal_Mars · March 30, 2022, 7:55am

Wherever there is a linear function between time and temperature, do we even need to perform regression between them ? Do you think that is necessary ? I think, we could do it only for the latter - where SAM Testbeds have a non-linear time-temp function.

onurk83 · March 31, 2022, 4:48pm

Hi Vatsal_Mars,

I think, you are right. I can say that you should use only “temperature” and ignore time.

Onur Koç.

Topic		Replies	Views
Insufficient number of samples (Feature engineering) Mars Spectrometry	0	337	March 16, 2022
Label definition Mars Spectrometry	0	334	April 17, 2022
Is time in seconds? Mars Spectrometry	1	306	September 27, 2022
Carrier gas mix-up? Mars Spectrometry	1	439	March 8, 2022
Abundance values dependence Mars Spectrometry	1	341	April 4, 2022

Dependence of time and temperature

Related topics