FIT2102 Programming Paradigms

48 阅读15分钟

FIT2102 Programming Paradigms 2024
Assignment 2: Markdown to HTML
Due Date: Friday, 18th October, 11:55 pm
Weighting: 30% of your final mark for the unit
Interview: SWOTVAC + Week 13
Overview: Students will work independently to create a parser for a subset of the
Markdown specification using functional programming techniques. Programs will be
implemented in Haskell. The goal is to demonstrate a good understanding of
functional programming techniques as explored throughout the unit, including
written documentation of the design decisions and features.
Submission Instructions
Submit a zipped file named .zip which extracts to a folder
named

● It must contain all the code that will be marked including the report and all
code files
● You also need to include a report describing your design decisions. The report
must be named _.pdf.
● No additional Haskell libraries should be used
○ You may use additional libraries only for testing purposes.
● Before zipping, run stack clean --full (to ensure a small bundle)
● Do not submit node_modules or the .git folder
● Make sure the code you submit executes properly.
The marking process will look something like this:

  1. Extract _.zip
  2. Copy the submission folder contents into the assignment code bundle
    submission folder
  3. Execute stack build, stack test (for automated testing) and stack
    exec main/npm run dev for front end
    Please ensure that you test this process before submitting. Any issues during this
    process will make your marker unhappy and may result in a deduction in marks.
    Late submissions will be penalised at 5% per calendar day, rounded up. Late
    submissions more than seven days will receive zero marks and no feedback.Table of Contents
    Assignment 2: Markdown to HTML 1
    Submission Instructions 1
    Table of Contents 2
    Introduction 3
    Goals / Learning Outcomes 3
    Scope of assignment 3
    Exercises (24 marks) 4
    Part A: (12 marks): Parsing Markdown 5
    Aside - Text Modifiers (2 marks) 5
    Images (0.5 marks) 6
    Footnote References (0.5 marks) 6
    Free Text (1 mark) 6
    Headings (1 mark) 6
    Blockquotes (1 mark) 7
    Code (1 mark) 8
    Ordered Lists (2 marks) 8
    Tables (3 marks) 8
    Part B: (6 marks): HTML Conversion 9
    Text Modifiers (1 mark) 9
    Images (0.5 marks) 10
    Footnote References (0.5 marks) 10
    Free Text (0.5 marks) 10
    Headings (0.5 marks) 10
    Blockquotes (0.5 marks) 10
    Code (0.5 marks) 11
    Ordered Lists (1 marks) 11
    Tables (1 mark) 11
    Part C (6 marks): Adding extra functionality to the webpage 13
    Part D (up to 6 bonus marks): Extension 14
    Report (2 marks) 15
    Code Quality (4 marks) 16
    Marking breakdown 17
    Correctness 17
    Minimum Requirements: 18
    Changelog 18Introduction
    In this assignment, we will use Haskell to develop a transpiler that converts Markdown
    strings into HyperText Markup Language (HTML). This task involves parsing Markdown
    syntax and generating corresponding HTML output. A web page is provided, in which
    Markdown will be sent through an HTML-based websocket connection to a Haskell
    backend server, the Haskell server will need to convert this Markdown into the
    corresponding HTML and return it back to the website. A skeleton code was provided
    which will handle the basic communication between the web page and your assignment
    code.
    You are encouraged to utilise materials covered in previous weeks, including solutions
    for tutorial questions, to aid in the development of your transpiler. You must reference\ or cite ideas and 代 写FIT2102 Programming Paradigmscode constructs obtained from external sources, as well as
    anything else you might find in your independent research, for this assignment.
    The assignment is split up into Part A (parsing), Part B (pretty printing) and Part C
    (extras). However, we do recommend completing Part A/Part B in tandem.
    The language you will parse will be based on the Markdown specification, however with
    additional restrictions to reduce ambiguity. It is important that you read the requirements
    of each exercise carefully to avoid unnecessary work.
    Goals / Learning Outcomes
    The purpose of this assignment is to highlight and apply the skills you have learned to a
    practical exercise (parsing):
    ● Use functional programming and parsing effectively
    ● Understand and be able to use key functional programming principles (higher
    order functions, pure functions, immutable data structures, abstractions)
    ● Apply Haskell and FP techniques to parse non-trivial Markdown text
    Scope of assignment
    You are only required to parse an expression into the necessary data types and convert
    the result to an HTML string such that it can be rendered using an existing interpreter.
    You will not be required to render the Markdown or HTML strings.Exercises (24 marks)
    These exercises provide a structured approach for creating the beginnings of a
    transpiler.
    ● Part A (12 marks): Parsing Markdown strings
    ● Part B (6 marks): Conversion between Markdown and HTML
    ● Part C (6 marks): Adding extra functionality to the webpage.
    ● (Extension) Part D Part E: extensions for bonus marks!
    You must parse the input into an intermediary representation (ADT) such as an
    Abstract Syntax Tree to receive marks. This will allow easy conversion between
    your ADT and HTML.
    You must add deriving Show to your ADT and all custom types your ADT
    contains. (Note that the skeleton code already has deriving Show on the ADT type
    for you, which you must not remove.) You must not override this default Show
    instance as this will help us test your code.
    Your Assignment.hs file must export the following functions:
    ● markdownParser :: Parser ADT
    ● convertADTHTML :: ADT -> String
    Example Scripts
    For each of these exercises, there will be a series of provided Markdown files. By
    running stack test, it will try to parse the Markdown and save the output to a folder.
    This will generate HTML which you can manually view for correctness in a browser.
    During marking, we will be running your transpiler on more complex examples than the
    provided example scripts, therefore, it is important you devise your own test cases to
    ensure your parser is valid on more complex Markdown. It will also aim to produce a git
    diff, which is the difference between your output and the expected output. However, this
    requires installing the git command line tool. So, ensure that it is installed.
    Furthermore, the more recommended way to test your code will be to use npm run
    dev in combination with stack run main can be used to run the webpage with a live
    editor, running your code in real-time.Part A: (12 marks): Parsing Markdown
    The first part of this task, requires you to parse a markdown string into an Algebraic
    Data Type (ADT). This requires you to define your own Algebraic Data Type and define
    a series of functions that parse everything in the requirements. Consider that you will
    need to convert the result to HTML and therefore, your ADT should have enough
    information to assist you in converting to HTML.
    Aside - Text Modifiers (2 marks)
    There are six different modifiers for inline text, which can change the way a
    markdown string will be rendered. You do not have to worry about any escape
    characters. All text modifiers will need to be strictly non-empty.
    ● Italic Text: Specified by a single underscore character, _. For example,
    italics
    ● Bold Text: Specified by a set of two asterisks, **, around a word. For example,
    bold
    ● Strikethrough: Specified by two tilde characters, ~~, around a word. For example,
    strikethrough
    ● Link: Users can include a link to an external page using link text.
    For example, click here. You do not need to
    consider links inside links.
    ● Inline Code: Users can include code in the middle of sentences, using a backtick
    character, . For example, there is code` here
    ● Footnotes: Users can indicate a footnote with [^ℤ

], where ℤ

  • = {1,2,3,…}, i.e.,
    any positive integer. For example, 1, 2 and so forth. Note that you do
    not need to validate any sort of ordering on these numbers, e.g., the markdown
    may only contain one footnote [^10]. You also do not need to validate that the
    footnote comes with an appropriate reference (see Footnote References).
    ○ Note that there must not be any whitespace inside the [ and ]. For
    example, 1, 2, and [ ^3] are all not valid footnotes.
    You do not need to consider text with nested modifiers, such as bold and
    italics
    .
    Unless specified otherwise, the text inside the modifiers can include any amount of
    whitespace (excluding new lines). For example, _ italics _, **bold **, ~~
    strikethrough~~, inline code, and link text, and
    [link text] (example.com) are all valid.Images (0.5 marks)
    An image is specified with three parts:
  1. The Alt Text is the alternative text for the image, which is displayed if the
    image fails to load or for accessibility purposes.
  2. The URL is the URL or path to the image file. This can be a web URL or a local
    file path. The URL cannot contain any whitespace.
  3. The Caption Text is the caption for the image.
    Alt Text转存失败,建议直接上传图片文件
    The alternative text, caption text, and URL should not consider the text modifiers.
    An image must be at the beginning of a line, and the exclamation mark (!) character
    may be preceded by zero or more (non-newline) whitespace characters.
    There must be at least one non-newline whitespace character between the URL and
    the caption text. For example, ![Alt Text](URL"Caption Text") is not a valid
    image.
    There must not be any spaces after the ! and before the [.
    Footnote References (0.5 marks)
    Similarly to footnotes, footnote references consist of at the beginning of a line:
    ● zero or more (non-newline) whitespace characters, followed by
    ● [^ℤ
    +], where ℤ
  • = {1,2,3,…}, i.e., any positive integer, followed by
    ● a colon (:), followed by
    ● some text. Note that this text will not include the text modifiers. Leading
    whitespace before the text should be ignored.

Heading 1

Heading 2

Heading 3

Heading 4
Heading 5
Heading 6

This heading has a space before the hash

Note that because at least one non-newline whitespace character is required, this is not
a valid heading: #Heading 1. Also, because the line must start with the hash
characters (or whitespace), the following is not a valid heading: abc # Heading.
Alternatively, Heading 1 and Heading 2 can be specified with an alternative syntax
(shown below). On the line below the text, add at least 2 equals sign (=) characters for
heading level 1 or at least 2 dash (-) characters for heading level 2. The line below the
text must not contain any other characters. There is no alternative syntax for any other
heading levels. The heading text, equals sign (=) characters and dash (-) characers
may be preceded by zero or more (non-newline) whitespace characters.
Alternative Heading 1

Heading level 2

A heading 1 with a space in front of it====
Importantly, headings may include any of the previously mentioned text modifiers, for
example, a heading can be bolded, by surrounding it with a double asterisk.

Bolded Heading 1

Blockquotes (1 mark)
To create a block quote in Markdown, you use the greater than symbol (>) at the
beginning of a line followed by the text you want to quote. The greater than symbol (>)
may be preceded by zero or more (non-newline) whitespace characters before it on the
same line. You can also include multiple lines of text within the same block quote by
starting each consecutive line with the greater than symbol (>). Leading whitespace
after the greater than symbol (>) and before the text should be ignored. The text inside
the block quote may have text modifiers. You do not need to consider nested block
quotes. For example:

This is a block quote.
It can span multiple lines.
This has a space before > and is also a block quote
Code (1 mark)
A code block in Markdown starts with three backticks (```) on a line by themselves,
followed by an optional language identifier. The code block ends with another three
backticks on a line by themselves. The code block should not consider the text
modifiers. The first three backticks may have zero or more (non-newline) whitespace
characters preceding it. An example code block is:

main :: IO ()  
main = do  
putStrLn "Never gonna give you up"  
putStrLn "Never gonna let you down"  
putStrLn "Never gonna run around and desert you"  
Never gonna let you cryNever gonna say goodbye  
Never gonna tell a lie and hurt you  

Ordered Lists (2 marks)
An ordered list consists of at least one ordered list item separated by exactly 1 new line
character. An ordered list item starts with a positive number at the beginning of a line, a
. (full stop) character, and at least one whitespace character (excluding new lines). An
ordered list must start with the number 1, and any number after that can appear. You do
not have to consider any other numbering system or an unordered list.
Ordered lists may contain sublists, where there will be exactly 4 spaces before each
ordered list item. Each sublist must also start with the number 1. Similar to previous
sections, list items may also contain text modifiers.

  1. Item 1
  2. Sub Item 1
  3. Sub Item 2
  4. Sub Item 3
  5. Bolded Item 2
  6. Item 3
  7. Item 4
    You do not have to handle unordered lists.
    Ordered lists must not have any whitespace before the number, unless it is the 4
    spaces of indentation for a sublist. For example:
  8. This is an ordered list
  9. This is not an ordered list (starts with 2 spaces)
    Tables (3 marks)
    To create a table in Markdown, you use pipes (|) to separate columns and at least
    three dashes (-) between each column to separate the header row from the content
    rows. Each column may contain varying amounts of dashes. Each row is written on a
    separate line. The beginning and ending pipes (|) are compulsory. Each row must have
    the same amount of columns. Each cell may also contain text with the text modifiers.
    Leading and trailing whitespace before and after the text in each cell should be ignored.
    Each row in the table may be preceded by zero or more (non-newline) whitespace
    characters.| Tables | Are | Cool |
    | ------------- | ------------- | ----- |
    | here | is | data |
    | here | is | data |
    | here | is also | bolded data |
    | also | part of the | table |
    Part B: (6 marks): HTML Conversion
    The second part of this task requires you to convert your ADT into a HTML
    representation. The resulting HTML file must be formatted such that it is indented with 4
    spaces at the correct level to reflect the tree structure of HTML, ensuring that the HTML
    is valid and correctly renders the provided markdown. You do not need to indent the text
    modifiers, but other nested objects should be indented correctly.
    All HTML generated must be a self-contained webpage, i.e., including the following
    information, placing all generated HTML within the tags.
Test

GENERATED CONTENT GOES HERE

As a reference for the conversion between markdown and HTML, here will be listed the
conversion of all of the examples from above.
Text Modifiers (1 mark)
● Italics: italics
● Bold: bold
● Strikethrough: strikethrough
● Link: link text
● Inline Code: code● Footnotes: 1. It is
important that you follow this convention precisely, where 1 is the number
specified with the footnote, to ensure the footnotes work.
Images (0.5 marks)
The image must be in an image tag, with the appropriate attributes filled.
Alt Text转存失败,建议直接上传图片文件
Footnote References (0.5 marks)
A footnote reference must be encased in a

tag, and have the appropriately
numbered id.

My reference.

Another reference.

The 2 spaces after the colon should be ignored

space before the [

Free Text (0.5 marks) Every line of free text must be encased in

tags. You do not need to consider how to handle newlines.

Here is some markdown

More lines here

Text

Headings (0.5 marks) Where, the number after the h, contains the level of the heading, for example, in heading level 1:

Heading 1

Heading 2

Heading 3

Heading 4
Heading 5
Heading 6

This heading has a space before the hash

Alternative Heading 1

Heading level 2

A heading 1 with a space in front of it

Blockquotes (0.5 marks) Each blockquote must be encased by
, while each line within the blockquote must be encased with a

tag.

This is a block quote.

It can span multiple lines.

Code (0.5 marks) The code block must be encased in both the
 and the  and the  tags. If there is a  
language identifier (e.g., haskell), it must be included within the class attribute,  
prefixed by language-. Otherwise, there should not be any class attribute. The  
newlines and code indentation must remain.  

main :: IO ()  
main = do  
putStrLn "Never gonna give you up"  
putStrLn "Never gonna let you down"  
putStrLn "Never gonna run around and desert you"  

  

Never gonna let you cry  
Never gonna say goodbye  
Never gonna tell a lie and hurt you  

  
Ordered Lists (1 marks)  
Ordered lists must begin and end with the 
 tag, and each list item must begin and  
end with the opening/closing  tag.  

  
Item 1  

  
Sub Item 1  
Sub Item 2  
Sub Item 3
  
  
Bolded Item 2  
Item 3  
Item 4  

  
Tables (1 mark)  
The HTML convention for representing tables involves using the , ,  
 and  when rendering tables  
● 29 Sep: Fix indentation in ordered list HTML output  
  WX:codinghelp, and 	 elements.  represents the entire table,  represents a  
row within the table,  and  tags. Either of these outupts is  
acceptable:  
 represents a header cell within a table row, used for the  
header row, and 	 represents a data cell within a table row, used for the content  
rows.  
You may optionally include   
  
Tables	  
Are	  
Cool  

  
  
here	  
is	  
data  

  
  
here	  
is	  
data  

  
  
here	  
is also	  
bolded data  

  
  
also	  
part of the	  
table  
  
  
  
  
Tables	  
Are	  
Cool  
  
  
  
  
here	  
is	  
data  

  
  
here	  
is	  
data  

  
  
here	  
is also	  
bolded data  

  
  
also	  
part of the	  
table  
  
  
Part C (6 marks): Adding extra functionality to the webpage  
This task involves changing the webpage to include extra capabilities allowing a more  
feature-full UI. You will not be marked on the layout, or ease of use of features, as long  
as they are clearly visible to your marker, e.g., a button should be clearly visible on the  
screen. This task will involve some light additions to both the HTML page and  
TypeScript code. This will likely involve creating an observable stream for the data,  
merging it into the subscription stream, and sending the information to the Haskell  
backend. The communicated information between the Haskell backend and the  
webpage will need to be updated to include additional information that the user wants  
the engine to achieve.  
● A button must be added to the webpage for saving, where the converted HTML  
is saved using Haskell. The user does not need to be prompted for a file name,  
and the HTML should be saved according to the current time, formatted in ISO  
8601 format for the current date and time: YYYY-MM-DDTHH:MM:SS. The  
function getTime is provided which will provide you this time in an IO String  
format.  
● A separate input box, to allow the user to change the title of the page, instead of  
the default Converted HTML.Part D (up to 6 bonus marks): Extension  
Implement anything that is interesting, impressive, or otherwise “shows off” your  
understanding of Haskell, Functional Programming, and/or Parsing.  
To achieve the maximum amount of bonus marks, the feature should be similar in  
complexity to Part C (6 marks):  
The bonus marks only apply to this assignment, and the final mark for this assignment  
is capped at 30 marks (100%). This means you cannot score more than 30 marks or  
100%.  
Some suggestions for extensions of varying complexity and difficulty:  
● Markdown validation  
○ E.g., enforce all table columns have the same width  
● Correct BNF for the Markdown you are parsing in report (worth 2 marks)  
○ For any part of the parser which is not context-free, you may simplify the  
parsing rules to be context-free.  
● Further extensions to the webpage for extra features, using RxJS  
● Parse nested text modifiers, such as **_bold and italics_** and [click  
**here**](https://example.com)  
● Parse further parts of the markdown specification which make use of interesting  
parsers, which you have not used in other parts of the assignment.  
● Comprehensive test cases over the parser and pretty printing  
○ Warning: It is super hard to be comprehensive, stay away unless you love  
testing.  
(Choosing one of the simpler suggestions to implement may not receive the maximum  
available marks).Report (2 marks)  
You are required to provide a report in PDF format of max. 600 words (markers will not  
mark beyond this word limit). Descriptions of extensions can use up to 200 words per  
extension feature.  
Make sure to summarise the intention of the code, and highlight the interesting parts  
and difficulties you encountered. Focus on the "why" not the "how".  
Additionally, just posting screenshots of code is heavily discouraged, unless it  
contains something of particular importance. Remember, markers will be looking at your  
code alongside your report, so we do not need to see your code twice.  
Importantly, this report must include a description of why and how parser combinators  
helped you complete the parsing. In summary, your report should include the following  
sections:  
● Design of the code (including data structures)  
○ High-level description of approach  
○ High-level structure of code  
○ Code architecture choices  
● Parsing  
○ Usage of parser combinators  
○ Choices made in creating parsers and parser combinators  
○ How parsers and parser combinators were constructed using the Functor,  
Applicative, and Monad typeclasses  
● Functional Programming (focusing on the why)  
○ Small modular functions  
○ Composing small functions together  
○ Declarative style (including point free style)  
● Haskell Language Features Used (focusing on the why)  
○ Typeclasses and Custom Types  
○ Higher order functions, fmap, apply, bind  
○ Function composition  
● Description of Extensions (if applicable)  
○ What you intended to implement  
○ What you did implement  
○ What is cool/interesting/complex about it  
○ This may include using Haskell features that are not covered in course  
content  
There is some overlap between the sections. You should avoid repeating descriptions  
or ideas in the report.Code Quality (4 marks)  
Code quality will relate more to how understandable your code is. You must have  
readable and functional code, commented when necessary. Readable code means  
that you keep your lines at a reasonable length (< 80 characters), that you provide  
comments above non-trivial functions, and that you comment sections of your code  
whose function may not be clear.  
Your functions should all be small and modular, building up in complexity, and taking  
advantage of built-in functions or self-defined utility functions when possible. It should  
be easy to read and understand what each piece of your code is doing, and why it is  
useful. Do not reimplement library functions, such as map, and use the appropriate  
library function when possible.  
Your code should aim to re-use previous functions as much as possible, and not repeat  
work when possible.  
Code quality includes your ADT and if it is well structured, i.e., does not have a bunch of  
repeated data types and follows a logical manner (the JSON example from the applied  
session is a good example of what an ADT should look like).Marking breakdown  
The main marking criteria for each parsing and pretty printing exercise consists of two  
parts: correctness and FP style. Both correctness and FP style will be worth 50% of  
the marks for each of the exercises, i.e., if your code passes all tests, you will get at  
least half marks for Exercise A, and Exercise B.  
You will be provided with some sample input and tests for determining the validity of the  
outputted HTML files. The sample inputs provided will not be exhaustive, you are  
heavily encouraged to add your own, perhaps covering edge cases.  
Correctness  
We will be running a series of tests which test each exercise, and depending on how  
many of the tests you pass, a proportion of marks will be awarded  
FP Style  
FP style relates to if the code is done in a way that aligns with the unit content and  
functional programming.  
You must apply concepts from the course. The important thing here is that you need to  
use what we have taught you effectively. For example, defining a new type and its  
Monad instance, but then never actually needing to use it will not give you marks. Note:  
using bind (>>=) for the sake of using the Monad when it is not needed will not count  
as "effective usage."  
Most importantly, code that does not utilise Haskell's language features, and that  
attempts to code in a more imperative style, will not be awarded high marks.  
Minimum Requirements:  
An estimate of a passing grade will be parsing up to and including code blocks, but not  
lists or tables, where the difficulty and the marks step up. However, this will need to be  
accompanied by high code quality and a good report.  
A higher mark will require parsing of the more difficult data structures, and modifications  
of the HTML page.Changelog  
● Add note that text modifiers must be non-empty  
● Add note about BNF can simplify parser, if and only if the parser is not context  
free.  
● 18 Sep: Remove the requirement to parse nested text modifiers and instead  
make that an extension  
● 18 Sep: Fix issue in scaffold where frontend would show output HTML with a  
leading and trailing quote  
● 20 Sep: Changed “Abstract Data Type” to “Algebraic Data Type” (under Part A)  
● 24 Sep: Clarify that URLs in images should not consider text modifiers  
● 25 Sep: Clarify that there should be no spaces after ! and before [ in images  
● 25 Sep: Clarify whitespace rules for images, footnote references, headings,  
blockquotes, code blocks, and tables  
● 25 Sep: Specify how to convert a code block with no language identifier to HTML  
WX:codinghelp

Footnotes






My reference. ↩ ↩2






Another reference. ↩ ↩2