Skip to content

Commit

Permalink
Deploying to gh-pages from @ 7b06c85 🚀
Browse files Browse the repository at this point in the history
  • Loading branch information
gozdesahin committed May 20, 2024
1 parent fb051f7 commit 174d632
Show file tree
Hide file tree
Showing 32 changed files with 91 additions and 91 deletions.
2 changes: 1 addition & 1 deletion 404.html
Original file line number Diff line number Diff line change
Expand Up @@ -175,7 +175,7 @@ <h1 class="post-title">Page not found</h1>
<footer class="fixed-bottom">
<div class="container mt-0">
© Copyright 2024 GGLab . Photos from <a href="https://www.freepik.com/" target="_blank" rel="external nofollow noopener">Freepik</a>.
Last updated: May 16, 2024.
Last updated: May 20, 2024.
</div>
</footer>

Expand Down
2 changes: 1 addition & 1 deletion feed.xml
Original file line number Diff line number Diff line change
@@ -1,2 +1,2 @@
<?xml version="1.0" encoding="utf-8"?><feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en"><generator uri="https://jekyllrb.com/" version="4.3.3">Jekyll</generator><link href="https://gglab-ku.github.io/feed.xml" rel="self" type="application/atom+xml" /><link href="https://gglab-ku.github.io/" rel="alternate" type="text/html" hreflang="en" /><updated>2024-05-16T07:30:32+00:00</updated><id>https://gglab-ku.github.io/feed.xml</id><title type="html">blank</title><subtitle>Homepage for GGLab@Koc
<?xml version="1.0" encoding="utf-8"?><feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en"><generator uri="https://jekyllrb.com/" version="4.3.3">Jekyll</generator><link href="https://gglab-ku.github.io/feed.xml" rel="self" type="application/atom+xml" /><link href="https://gglab-ku.github.io/" rel="alternate" type="text/html" hreflang="en" /><updated>2024-05-20T11:23:03+00:00</updated><id>https://gglab-ku.github.io/feed.xml</id><title type="html">blank</title><subtitle>Homepage for GGLab@Koc
</subtitle></feed>
36 changes: 18 additions & 18 deletions index.html
Original file line number Diff line number Diff line change
Expand Up @@ -288,6 +288,23 @@ <h2><a href="/news/" style="color: inherit;">news</a></h2>
<table class="table table-sm table-borderless">


<tr>
<th scope="row" style="width: 100px">May 2024</th>
<td>

Paper accepted to <a href="https://2024.aclweb.org/" rel="external nofollow noopener" target="_blank">Findings of ACL 2024</a>!

<hr>
Our paper entitled <a href="/assets/pdf/2403.03167.pdf">PARADISE: Evaluating Implicit Planning Skills of Language Models with Procedural Warnings and Tips Dataset</a> is accepted to <a href="https://2024.aclweb.org/" rel="external nofollow noopener" target="_blank">Findings of ACL 2024</a>! Check the <a href="https://github.com/GGLAB-KU/paradise" rel="external nofollow noopener" target="_blank">repo</a> for more details 📣

<blockquote>
Abstract: Recently, there has been growing interest within the community regarding whether large language models are capable of planning or executing plans. However, most prior studies use LLMs to generate high-level plans for simplified scenarios lacking linguistic complexity and domain diversity, limiting analysis of their planning abilities. These setups constrain evaluation methods (e.g., predefined action space), architectural choices (e.g., only generative models), and overlook the linguistic nuances essential for realistic analysis. To tackle this, we present PARADISE, an abductive reasoning task using Q\&amp;A format on practical procedural text sourced from wikiHow. It involves warning and tip inference tasks directly associated with goals, excluding intermediary steps, with the aim of testing the ability of the models to infer implicit knowledge of the plan solely from the given goal. Our experiments, utilizing fine-tuned language models and zero-shot prompting, reveal the effectiveness of task-specific small models over large language models in most scenarios. Despite advancements, all models fall short of human performance. Notably, our analysis uncovers intriguing insights, such as variations in model behavior with dropped keywords, struggles of BERT-family and GPT-4 with physical and abstract goals, and the proposed tasks offering valuable prior knowledge for other unseen procedural tasks. The PARADISE dataset and associated resources are publicly available for further research exploration with this https URL.
</blockquote>


</td>
</tr>

<tr>
<th scope="row" style="width: 100px">Apr 2024</th>
<td>
Expand Down Expand Up @@ -339,23 +356,6 @@ <h2><a href="/news/" style="color: inherit;">news</a></h2>
</td>
</tr>

<tr>
<th scope="row" style="width: 100px">Mar 2024</th>
<td>

Our new paper is available on <a href="https://arxiv.org/" rel="external nofollow noopener" target="_blank">arXiv</a>!

<hr>
Our paper entitled <a href="/assets/pdf/2403.03167.pdf">PARADISE: Evaluating Implicit Planning Skills of Language Models with Procedural Warnings and Tips Dataset</a> is available on <a href="https://arxiv.org/" rel="external nofollow noopener" target="_blank">arXiv</a>! Check the <a href="https://github.com/GGLAB-KU/paradise" rel="external nofollow noopener" target="_blank">repo</a> for more details 📣

<blockquote>
Abstract: Recently, there has been growing interest within the community regarding whether large language models are capable of planning or executing plans. However, most prior studies use LLMs to generate high-level plans for simplified scenarios lacking linguistic complexity and domain diversity, limiting analysis of their planning abilities. These setups constrain evaluation methods (e.g., predefined action space), architectural choices (e.g., only generative models), and overlook the linguistic nuances essential for realistic analysis. To tackle this, we present PARADISE, an abductive reasoning task using Q\&amp;A format on practical procedural text sourced from wikiHow. It involves warning and tip inference tasks directly associated with goals, excluding intermediary steps, with the aim of testing the ability of the models to infer implicit knowledge of the plan solely from the given goal. Our experiments, utilizing fine-tuned language models and zero-shot prompting, reveal the effectiveness of task-specific small models over large language models in most scenarios. Despite advancements, all models fall short of human performance. Notably, our analysis uncovers intriguing insights, such as variations in model behavior with dropped keywords, struggles of BERT-family and GPT-4 with physical and abstract goals, and the proposed tasks offering valuable prior knowledge for other unseen procedural tasks. The PARADISE dataset and associated resources are publicly available for further research exploration with this https URL.
</blockquote>


</td>
</tr>

<tr>
<th scope="row" style="width: 100px">Jan 2024</th>
<td>
Expand Down Expand Up @@ -438,7 +438,7 @@ <h2><a href="/news/" style="color: inherit;">news</a></h2>
<footer class="fixed-bottom">
<div class="container mt-0">
© Copyright 2024 GGLab . Photos from <a href="https://www.freepik.com/" target="_blank" rel="external nofollow noopener">Freepik</a>.
Last updated: May 16, 2024.
Last updated: May 20, 2024.
</div>
</footer>

Expand Down
2 changes: 1 addition & 1 deletion members/alumni_andrew.html
Original file line number Diff line number Diff line change
Expand Up @@ -204,7 +204,7 @@ <h1 class="post-title">
<footer class="fixed-bottom">
<div class="container mt-0">
© Copyright 2024 GGLab . Photos from <a href="https://www.freepik.com/" target="_blank" rel="external nofollow noopener">Freepik</a>.
Last updated: May 16, 2024.
Last updated: May 20, 2024.
</div>
</footer>

Expand Down
2 changes: 1 addition & 1 deletion members/alumni_arda.html
Original file line number Diff line number Diff line change
Expand Up @@ -208,7 +208,7 @@ <h1 class="post-title">
<footer class="fixed-bottom">
<div class="container mt-0">
© Copyright 2024 GGLab . Photos from <a href="https://www.freepik.com/" target="_blank" rel="external nofollow noopener">Freepik</a>.
Last updated: May 16, 2024.
Last updated: May 20, 2024.
</div>
</footer>

Expand Down
2 changes: 1 addition & 1 deletion members/alumni_atakan.html
Original file line number Diff line number Diff line change
Expand Up @@ -204,7 +204,7 @@ <h1 class="post-title">
<footer class="fixed-bottom">
<div class="container mt-0">
© Copyright 2024 GGLab . Photos from <a href="https://www.freepik.com/" target="_blank" rel="external nofollow noopener">Freepik</a>.
Last updated: May 16, 2024.
Last updated: May 20, 2024.
</div>
</footer>

Expand Down
2 changes: 1 addition & 1 deletion members/alumni_farrin.html
Original file line number Diff line number Diff line change
Expand Up @@ -209,7 +209,7 @@ <h1 class="post-title">
<footer class="fixed-bottom">
<div class="container mt-0">
© Copyright 2024 GGLab . Photos from <a href="https://www.freepik.com/" target="_blank" rel="external nofollow noopener">Freepik</a>.
Last updated: May 16, 2024.
Last updated: May 20, 2024.
</div>
</footer>

Expand Down
2 changes: 1 addition & 1 deletion members/alumni_muge.html
Original file line number Diff line number Diff line change
Expand Up @@ -204,7 +204,7 @@ <h1 class="post-title">
<footer class="fixed-bottom">
<div class="container mt-0">
© Copyright 2024 GGLab . Photos from <a href="https://www.freepik.com/" target="_blank" rel="external nofollow noopener">Freepik</a>.
Last updated: May 16, 2024.
Last updated: May 20, 2024.
</div>
</footer>

Expand Down
2 changes: 1 addition & 1 deletion members/alumni_subha.html
Original file line number Diff line number Diff line change
Expand Up @@ -205,7 +205,7 @@ <h1 class="post-title">
<footer class="fixed-bottom">
<div class="container mt-0">
© Copyright 2024 GGLab . Photos from <a href="https://www.freepik.com/" target="_blank" rel="external nofollow noopener">Freepik</a>.
Last updated: May 16, 2024.
Last updated: May 20, 2024.
</div>
</footer>

Expand Down
2 changes: 1 addition & 1 deletion members/alumni_tilek.html
Original file line number Diff line number Diff line change
Expand Up @@ -208,7 +208,7 @@ <h1 class="post-title">
<footer class="fixed-bottom">
<div class="container mt-0">
© Copyright 2024 GGLab . Photos from <a href="https://www.freepik.com/" target="_blank" rel="external nofollow noopener">Freepik</a>.
Last updated: May 16, 2024.
Last updated: May 20, 2024.
</div>
</footer>

Expand Down
2 changes: 1 addition & 1 deletion members/faculty_gg.html
Original file line number Diff line number Diff line change
Expand Up @@ -208,7 +208,7 @@ <h1 class="post-title">
<footer class="fixed-bottom">
<div class="container mt-0">
© Copyright 2024 GGLab . Photos from <a href="https://www.freepik.com/" target="_blank" rel="external nofollow noopener">Freepik</a>.
Last updated: May 16, 2024.
Last updated: May 20, 2024.
</div>
</footer>

Expand Down
2 changes: 1 addition & 1 deletion members/intern_durhasan.html
Original file line number Diff line number Diff line change
Expand Up @@ -204,7 +204,7 @@ <h1 class="post-title">
<footer class="fixed-bottom">
<div class="container mt-0">
© Copyright 2024 GGLab . Photos from <a href="https://www.freepik.com/" target="_blank" rel="external nofollow noopener">Freepik</a>.
Last updated: May 16, 2024.
Last updated: May 20, 2024.
</div>
</footer>

Expand Down
2 changes: 1 addition & 1 deletion members/msc_ali.html
Original file line number Diff line number Diff line change
Expand Up @@ -205,7 +205,7 @@ <h1 class="post-title">
<footer class="fixed-bottom">
<div class="container mt-0">
© Copyright 2024 GGLab . Photos from <a href="https://www.freepik.com/" target="_blank" rel="external nofollow noopener">Freepik</a>.
Last updated: May 16, 2024.
Last updated: May 20, 2024.
</div>
</footer>

Expand Down
2 changes: 1 addition & 1 deletion members/msc_hulki.html
Original file line number Diff line number Diff line change
Expand Up @@ -208,7 +208,7 @@ <h1 class="post-title">
<footer class="fixed-bottom">
<div class="container mt-0">
© Copyright 2024 GGLab . Photos from <a href="https://www.freepik.com/" target="_blank" rel="external nofollow noopener">Freepik</a>.
Last updated: May 16, 2024.
Last updated: May 20, 2024.
</div>
</footer>

Expand Down
2 changes: 1 addition & 1 deletion members/msc_zeynel.html
Original file line number Diff line number Diff line change
Expand Up @@ -204,7 +204,7 @@ <h1 class="post-title">
<footer class="fixed-bottom">
<div class="container mt-0">
© Copyright 2024 GGLab . Photos from <a href="https://www.freepik.com/" target="_blank" rel="external nofollow noopener">Freepik</a>.
Last updated: May 16, 2024.
Last updated: May 20, 2024.
</div>
</footer>

Expand Down
2 changes: 1 addition & 1 deletion members/phd_abed.html
Original file line number Diff line number Diff line change
Expand Up @@ -204,7 +204,7 @@ <h1 class="post-title">
<footer class="fixed-bottom">
<div class="container mt-0">
© Copyright 2024 GGLab . Photos from <a href="https://www.freepik.com/" target="_blank" rel="external nofollow noopener">Freepik</a>.
Last updated: May 16, 2024.
Last updated: May 20, 2024.
</div>
</footer>

Expand Down
2 changes: 1 addition & 1 deletion members/phd_gurkan.html
Original file line number Diff line number Diff line change
Expand Up @@ -204,7 +204,7 @@ <h1 class="post-title">
<footer class="fixed-bottom">
<div class="container mt-0">
© Copyright 2024 GGLab . Photos from <a href="https://www.freepik.com/" target="_blank" rel="external nofollow noopener">Freepik</a>.
Last updated: May 16, 2024.
Last updated: May 20, 2024.
</div>
</footer>

Expand Down
2 changes: 1 addition & 1 deletion members/phd_tamta.html
Original file line number Diff line number Diff line change
Expand Up @@ -204,7 +204,7 @@ <h1 class="post-title">
<footer class="fixed-bottom">
<div class="container mt-0">
© Copyright 2024 GGLab . Photos from <a href="https://www.freepik.com/" target="_blank" rel="external nofollow noopener">Freepik</a>.
Last updated: May 16, 2024.
Last updated: May 20, 2024.
</div>
</footer>

Expand Down
2 changes: 1 addition & 1 deletion news/2023-07-12-paper-INLG23-accepted/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -191,7 +191,7 @@ <h1 class="post-title">Paper Inlg23 Accepted</h1>
<footer class="fixed-bottom">
<div class="container mt-0">
© Copyright 2024 GGLab . Photos from <a href="https://www.freepik.com/" target="_blank" rel="external nofollow noopener">Freepik</a>.
Last updated: May 16, 2024.
Last updated: May 20, 2024.
</div>
</footer>

Expand Down
2 changes: 1 addition & 1 deletion news/2023-09-06-papers-AACL23-accepted/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -198,7 +198,7 @@ <h1 class="post-title">Papers Aacl23 Accepted</h1>
<footer class="fixed-bottom">
<div class="container mt-0">
© Copyright 2024 GGLab . Photos from <a href="https://www.freepik.com/" target="_blank" rel="external nofollow noopener">Freepik</a>.
Last updated: May 16, 2024.
Last updated: May 20, 2024.
</div>
</footer>

Expand Down
2 changes: 1 addition & 1 deletion news/2024-01-18-paper-arxiv/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -191,7 +191,7 @@ <h1 class="post-title">Paper Arxiv</h1>
<footer class="fixed-bottom">
<div class="container mt-0">
© Copyright 2024 GGLab . Photos from <a href="https://www.freepik.com/" target="_blank" rel="external nofollow noopener">Freepik</a>.
Last updated: May 16, 2024.
Last updated: May 20, 2024.
</div>
</footer>

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -10,7 +10,7 @@
<meta charset="utf-8">
<meta name="viewport" content="width=device-width, initial-scale=1, shrink-to-fit=no">
<meta http-equiv="X-UA-Compatible" content="IE=edge">
<title>Paper Arxiv | GGLab </title>
<title>Paper Acl Findings | GGLab </title>
<meta name="author" content="GGLab ">
<meta name="description" content="Homepage for GGLab@Koc
">
Expand Down Expand Up @@ -39,7 +39,7 @@
<link rel="shortcut icon" href="/assets/img/favicon144x144.png">

<link rel="stylesheet" href="/assets/css/main.css">
<link rel="canonical" href="https://gglab-ku.github.io/news/2024-03-06-paper-arxiv/">
<link rel="canonical" href="https://gglab-ku.github.io/news/2024-05-16-paper-ACL-findings/">

<!-- Dark Mode -->

Expand Down Expand Up @@ -158,8 +158,8 @@
<div class="post">

<header class="post-header">
<h1 class="post-title">Paper Arxiv</h1>
<p class="post-meta">March 6, 2024</p>
<h1 class="post-title">Paper Acl Findings</h1>
<p class="post-meta">May 16, 2024</p>
<p class="post-tags">
<a href="/blog/2024"> <i class="fas fa-calendar fa-sm"></i> 2024 </a>

Expand All @@ -169,10 +169,10 @@ <h1 class="post-title">Paper Arxiv</h1>
<article class="post-content">

<div id="markdown-content">
<p>Our new paper is available on <a href="https://arxiv.org/" rel="external nofollow noopener" target="_blank">arXiv</a>!</p>
<p>Paper accepted to <a href="https://2024.aclweb.org/" rel="external nofollow noopener" target="_blank">Findings of ACL 2024</a>!</p>

<hr>
<p>Our paper entitled <a href="/assets/pdf/2403.03167.pdf">PARADISE: Evaluating Implicit Planning Skills of Language Models with Procedural Warnings and Tips Dataset</a> is available on <a href="https://arxiv.org/" rel="external nofollow noopener" target="_blank">arXiv</a>! Check the <a href="https://github.com/GGLAB-KU/paradise" rel="external nofollow noopener" target="_blank">repo</a> for more details 📣</p>
<p>Our paper entitled <a href="/assets/pdf/2403.03167.pdf">PARADISE: Evaluating Implicit Planning Skills of Language Models with Procedural Warnings and Tips Dataset</a> is accepted to <a href="https://2024.aclweb.org/" rel="external nofollow noopener" target="_blank">Findings of ACL 2024</a>! Check the <a href="https://github.com/GGLAB-KU/paradise" rel="external nofollow noopener" target="_blank">repo</a> for more details 📣</p>

<blockquote>
<p>Abstract: Recently, there has been growing interest within the community regarding whether large language models are capable of planning or executing plans. However, most prior studies use LLMs to generate high-level plans for simplified scenarios lacking linguistic complexity and domain diversity, limiting analysis of their planning abilities. These setups constrain evaluation methods (e.g., predefined action space), architectural choices (e.g., only generative models), and overlook the linguistic nuances essential for realistic analysis. To tackle this, we present PARADISE, an abductive reasoning task using Q\&amp;A format on practical procedural text sourced from wikiHow. It involves warning and tip inference tasks directly associated with goals, excluding intermediary steps, with the aim of testing the ability of the models to infer implicit knowledge of the plan solely from the given goal. Our experiments, utilizing fine-tuned language models and zero-shot prompting, reveal the effectiveness of task-specific small models over large language models in most scenarios. Despite advancements, all models fall short of human performance. Notably, our analysis uncovers intriguing insights, such as variations in model behavior with dropped keywords, struggles of BERT-family and GPT-4 with physical and abstract goals, and the proposed tasks offering valuable prior knowledge for other unseen procedural tasks. The PARADISE dataset and associated resources are publicly available for further research exploration with this https URL.</p>
Expand All @@ -191,7 +191,7 @@ <h1 class="post-title">Paper Arxiv</h1>
<footer class="fixed-bottom">
<div class="container mt-0">
© Copyright 2024 GGLab . Photos from <a href="https://www.freepik.com/" target="_blank" rel="external nofollow noopener">Freepik</a>.
Last updated: May 16, 2024.
Last updated: May 20, 2024.
</div>
</footer>

Expand Down
2 changes: 1 addition & 1 deletion news/TA-award-abed/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -191,7 +191,7 @@ <h1 class="post-title">Ta Award Abed</h1>
<footer class="fixed-bottom">
<div class="container mt-0">
© Copyright 2024 GGLab . Photos from <a href="https://www.freepik.com/" target="_blank" rel="external nofollow noopener">Freepik</a>.
Last updated: May 16, 2024.
Last updated: May 20, 2024.
</div>
</footer>

Expand Down
Loading

0 comments on commit 174d632

Please sign in to comment.