CS885 Fall 2022 - Reinforcement Learning

Overview

40% of final grade
To be done individually
To search for papers on a topic, use Google scholar. Most papers available on the web can be found via Google scholar. Furthermore, most links in Google scholar are free. As a student you should never have to pay to download a paper. If Google scholar does not provide a free link, you should be able to access the paper freely via the University's internet or the University's library website.

Options

Option A (Literature survey):
- Pick a problem that interests you
- Search the literature for reinforcement learning approaches to tackle this problem
- Survey and discuss the relative strengths of each approach
- If you'd like to see examples of survey papers in AI, have a look at the IJCAI-2021 survey track
Option B (Empirical evaluation):
- Pick a problem that interests you.
- Implement and experiment with several reinforcement learning techniques to tackle this problem.
Option C (Algorithm design):
- Identify a problem for which there are no satisfying approaches.
- Develop a new reinforcement learning technique to tackle this problem.
- Analyze theoretically and/or empirically the performance of your technique.
Option D (Dataset/Simulator/Benchmark design):
- Identify a problem for which there is a lack of datasets, simulators or benchmarks to evaluate reinforcement learning algorithm.
- Collect a dataset, design a new simulator or design a new benchmark to evaluate reinforcement learning algorithms.
- Demonstrate how some baseline reinforcement learning algorithms perform with your dataset, simulator or environment.
- If you'd like to see examples of papers describing datasets, simulators or benchmarks, have a look at the NeurIPS-2021 datasets and benchmarks track
Option E (Theoretical analysis):
- Identify a problem or reinforcement learning technique for which the properties (e.g., complexity, performance) are not well understood.
- Analyze the properties of this problem or technique.

Proposal (no mark)

Submit electronically via LEARN by October 26 (11:59 pm)
At most one page (excluding references)
Use the JMLR format: https://www.jmlr.org/format/format.html
Which option did you pick?
Option A (Literature survey):

What is the problem?
Cite 8 to 12 papers that you plan to survey.

Option B (Empirical evaluation):
- What is the problem?
- What reinforcement learning techniques do you plan to experiment with?
- Cite 4 to 8 related papers that you plan to review.
Option C (Algorithm design):
- What is the problem?
- Why are there no satisfying approaches?
- What is the intuition behind the new reinforcement learning technique that you plan to develop?
- Cite 4 to 8 related papers that you plan to review.
Option D (Dataset/Simulator/Benchmark design):
- What is the reinforcement learning problem for which there is a lack of datasets, simulators or benchmarks?
- What dataset, simulator or benchmark do you plan to design?
- Cite 4 to 8 related papers that you plan to review.
Option E (Theoretical analysis):
- What is the reinforcement learning problem or technique that you plan to analyze?
- What properties would you like to analyze/prove about this problem or technique?
- Cite 4 to 8 related papers that you plan to review.

Report (40% of final grade)

At most 8 pages (excluding references)
Use the JMLR format: https://www.jmlr.org/format/format.html
Explain the big picture and any necessary detail
Submit electronically via LEARN by December 12 (11:59 pm)

Suggested Structure for the Report

Option A (Literature survey):
- Introduction
  - What is the problem?
  - Why is it an important problem?
- Survey
  - Summarize the range of techniques by highlighting their strengths and weaknesses (i.e., the 8-12 papers that you read)
  - Tip: this summary should not be a laundry list of techniques with an independent paragraph for each technique
  - Suggestion: organize your summary based on desirable properties of the techniques
- Analysis:
  - What is the state of the art?
  - Any open problem?
- Conclusion
  - What have you learned?
  - What future research do you recommend?
Option B (Empirical evaluation):
- Introduction
  - What is the problem?
  - Why is it an important problem?
- Techniques to tackle the problem
  - Brief review of previous work concerning this problem (i.e., the 4-8 papers that you read)
  - Brief description of the techniques chosen and why
- Empirical evaluation
  - Compare empirically the techniques for complexity, performance, ease of use, etc.
- Conclusion:
  - What is the best technique?
  - Is any technique good enough to declare the problem solved?
  - What future research do you recommend?
Option C (Algorithm design):
- Introduction
  - What is the problem?
  - Why can't any of the existing techniques effectively tackle this problem?
  - What is the intuition behind the technique that you developed?
- Techniques to tackle the problem
  - Brief review of previous work concerning this problem (i.e., the 4-8 papers that you read)
  - Describe the technique that you developed
  - Brief description of the existing techniques that you will compare to
- Evaluation
  - Analyze and compare (empirically or theoretically) your new approach to existing approaches
- Conclusion:
  - Can your new technique effectively tackle the problem?
  - What future research do you recommend?
Option D (Dataset/Simulator/Benchmark):
- Introduction
  - What is the problem?
  - Why aren't existing datasets, simulators or benchmarks sufficient to evaluate RL techniques for this problem?
- Proposed Dataset, Simulator or Benchmark
  - Describe the proposed dataset, simulator or benchmnark
  - Describe the properties of the dataset, simulator or benchmark that are unique
- Evaluation
  - Brief description of some baseline RL techniques that you plan to evaluate
  - Compare empirically the baseline RL techniques with your dataset, simulator or benchmark
- Conclusion:
  - What are the most important weaknesses of existing baselines that your dataset, simulator or benchmark highlighted
  - What future research do you recommend?
Option E (Theoretical analysis):
- Introduction
  - What is the problem or technique?
  - What properties did you analyze/prove about this problem or technique?
- Analysis
  - Brief survey of previous work concerning this problem (i.e., the 4-8 papers that you read)
  - Describe the analysis performed
- Conclusion:
  - What have you discovered about the technique analyzed?
  - What future research do you recommend?