aboutsummaryrefslogtreecommitdiff
path: root/src
Commit message (Collapse)AuthorAgeFilesLines
* [cup] Self-hosting is now possible! Make some tweaks to match C outputMustafa Quraish2022-02-072-3/+2
| | | | | | | | | | | A bit of a chonky commit, but this ports over the remaining (well, almost) everything from the C implementation to the self-hosted compiler. The only things that really remain right now are (1) defer support and (2) support for constants in local scopes. There were used barely enough so for now their uses have been removed, but I'll implement them back later. Not sure how useful (2) is though.
* [C]: Add `here` keyword that evaluates to a string with it's locationMustafa Quraish2022-02-061-0/+9
| | | | | | | | | It's not really a keyword like the other ones, but just handled completely at the lexer level since it already knows the location of the token, so it injects a string literal instead. We also use this in the self-hosted compiler now for better error reporting for where the error happened internally.
* Some very minor fixes (look at message)Mustafa Quraish2022-02-051-0/+1
| | | | | (1) Add support for escaped single quotes (2) Fix `putu_buffer` in `std/common.cup`
* Allow empty return statements for void functionsMustafa Quraish2022-02-052-4/+15
|
* Fix label counts during code generation for `||` and `&&`Mustafa Quraish2022-02-051-10/+10
|
* Add `lseek` and `mmap` syscall infoMustafa Quraish2022-02-051-1/+16
|
* Allow function declarations without a definitionMustafa Quraish2022-02-053-14/+41
|
* Handle command-line arguments properly on linuxMustafa Quraish2022-02-051-4/+13
| | | | | | | Turns out they're supposed to be accessed on the stack there. For macOS, because 16 byte-alignment is "required", we don't know the exact position of the arguments and that they're also passed in through `rdi` and `rsi`
* Add `OS_IS_MACOS` and `OS_IS_LINUX` constants as builtinsMustafa Quraish2022-02-051-0/+8
|
* Add support for some more binary ops: |, &, ^Mustafa Quraish2022-02-057-18/+58
|
* Add `sizeof(<type>)` operator that can return the size of a type.Mustafa Quraish2022-02-052-0/+9
|
* Add ability to initialize global variablesMustafa Quraish2022-02-053-4/+14
| | | | The code to initialize them is put right before calling `main()`
* Add `void` type and allow void* to be assigned to other ptr typesMustafa Quraish2022-02-054-7/+18
|
* Allow `builtins.c` to inject constants into program, use for syscallsMustafa Quraish2022-02-056-69/+95
| | | | | | We can now directly expose the `syscallN()` APIs to the program and define the `open()`, `write()` etc syscalls in the stdlib. This simplifies the implementation a decent bunch :^)
* Add an enum-like-but-not-really structureMustafa Quraish2022-02-042-0/+49
| | | | | | Basically, it looks like an enum but it isn't actually a type. It just behaves like a container to number all the (global) constants defined within it.
* Some minor bug fixesMustafa Quraish2022-02-042-4/+12
| | | | | | (1) Return correct size for unions (2) Make sure function name identifier doesn't exist (3) Assign `<anonymous>` as display name for nested compound types
* Minor fixes, rearranging, whitespace trimming. No functional changes.Mustafa Quraish2022-02-043-21/+19
|
* Don't open/parse a file that's been included more than onceMustafa Quraish2022-02-041-21/+39
| | | | | We don't use absolute paths or anything fancy right now, but it's a basic prototype which can be extended upon.
* Add support for read-only constant expressionsMustafa Quraish2022-02-043-6/+120
| | | | | | | We can now have constant definitions that are read-only, evaluated at compile time and just behave like integer literals when accessed. These are nice because we can now potentially inject syscall numbers/ etc through the compiler.
* Add `exit()` syscall builtinMustafa Quraish2022-02-031-0/+3
|
* Modify implementation of structs to support unionsMustafa Quraish2022-02-034-13/+19
| | | | | | This was simple enough, we just needed to change the part where we were computing the offset for each field, and the total size of the compound structure.
* Add support for basic structsMustafa Quraish2022-02-038-6/+192
| | | | | | | | Structs for now (and probably for the near future) are not allowed to be passed by value, and instead you just pass a pointer to it. Nested structs can also be defined, and they can be either anonymous, or named (in which case only the members can access the type).
* Add automatic type inference for initialized variable declarationsMustafa Quraish2022-02-031-17/+27
| | | | | | If we know the type of the expression on the RHS, and no type is specified, then we can just assume that the variable has the same type.
* Add pre-increment and pre-decrement operators.Mustafa Quraish2022-02-032-2/+34
| | | | | | The implementation is very hacky, maybe there's a better way to go about doing this. Maybe just keep it as single AST node and do the work during codegen?
* Add an implicit block around a `for` loop, allow statement initMustafa Quraish2022-02-031-12/+27
| | | | | Now we can have variable declarations as part of the initialization of the for loop, and it can only be accessed within the loop.
* Push `argc` and `argv` to the `main()` functionMustafa Quraish2022-02-031-0/+10
|
* Add helper to create builtins for syscalls + implement `read()`Mustafa Quraish2022-02-035-74/+125
| | | | | | | | | | This was possible, but very tedious to do by hand before. Now we automate it based on the number of arguments. Note that currently we can't just add `syscall3()` etc as builtins because the actual numbers for the system calls vary from one system to another, and we want to maintain support for macOS and Linux (at least for now).
* Check for integer-like types in type-checking instead of exact onesMustafa Quraish2022-02-031-7/+7
| | | | Since we allow implicit conversions now the exact type is less relevant.
* Remove `putc` intrinsic and replace with `write(fd, buf, size)`Mustafa Quraish2022-02-034-10/+30
| | | | | `putc`, `puts`, and `putnum` in `std/common.cup` are now implemented in terms of the `write()` syscall.
* Move builtins to a separate fileMustafa Quraish2022-02-033-28/+54
| | | | | Probably want to add more builtins in the future, so pulling it out of `parser.c` seems like the reasonable thing to do
* Allow implicitly converting between integer-like-typesMustafa Quraish2022-02-033-4/+32
| | | | This design should also be useful for structs down the road.
* Avoid pointer-arithmetic multiplications if we have `char*`Mustafa Quraish2022-02-031-24/+34
| | | | Simple optimization, it was just adding extra overhead anyway.
* Add support for `char` type + string/char literalsMustafa Quraish2022-02-028-21/+153
| | | | | | | | | | | This commit does a few things in one go: - Add support for a `char` type + some changes to support the new size - Add support for character literals. We need some escaping here to be able to use `\n` and `\0`, etc. - Add support for string literals. These are all stored in the `.data` section. Fortunately NASM already handles the escape characters. - Fix some bugs with code generation, specifically using `movsx` to sign extend the smaller types into 64-bit registers.
* Remove default initialization to 0 for variable declarationsMustafa Quraish2022-02-022-5/+3
| | | | | | This made sense when all variables were of the same size, but now with arrays we don't initialize them by default. Perhaps we should find a way to 0-initialize all the memory?
* Handle OP_ASSIGN in `print_ast()`Mustafa Quraish2022-02-021-0/+6
|
* Add initial support for arrays (also no testing)Mustafa Quraish2022-02-026-10/+95
| | | | | | Usual disclaimer at this point: Quick&Dirty implementation, hasn't been tested other than basic sanity checks. Arrays are automatically decayed into pointers when the identifier is accessed.
* Move type-related stuff to a separate fileMustafa Quraish2022-02-025-194/+217
| | | | | | It was getting a bit unwieldy in `parser.c`, and this will potentially help when we start dealing with more complex type-stuff such as casting / conversions / etc.
* Type checking of expressions / functions!Mustafa Quraish2022-02-026-14/+210
| | | | | | | | | | | | | | | | | This is a bit of a chonky commit, but it adds in the basics of checking the types of expressions / function calls / return types. There's still a lot of work to be done, including: (1) Adding new core types, and casting between allowed types automatically (2) Picking the corrent output type based on input types (for instance float+int == float) (3) We need much better error reporting, the error messages are really vague and unhelpful as-is (4) We also need to work to ensure that a function with a return type actually returns (5) Possible re-factoring to make stuff less hacky when we have more types / structs / arrays / etc.
* Use `type*` instead of `type&` to denote a pointer type (for now)Mustafa Quraish2022-02-021-1/+1
| | | | | | While I prefer the latter, the lexer/parser cannot correctly parse `type&&` since it thinks `&&` is `TOKEN_AND`. Perhaps how these cases are handled can be changed in the future.
* Add support for pointers! (tests missing)Mustafa Quraish2022-02-024-17/+30
| | | | | | | | | | This commit adds initial support for taking pointers / dereferencing. The type system is still a bit of a hot mess, so all type information is actually not looked at, but the functionality still seems to be there. Still need to add some tests for pointers/dereferencing to ensure that it works in some edge cases as well.
* Refactor variable access+assignment in terms of `generate_lvalue()`Mustafa Quraish2022-02-021-20/+21
| | | | This will make it trivial to compute pointers!
* Add `size_for_type()` helper instead of hard-coding variable sizesMustafa Quraish2022-02-023-5/+16
|
* Modify how types are stored.Mustafa Quraish2022-02-024-99/+143
| | | | | | | We now dynamically allocate the type structure, and recursively store a reference to the original type if it's a pointer. For now it's a little bit of a waste but it helps us future-proof stuff for more complex things to come
* Defer: Pop elements off the defer-stack when returningMustafa Quraish2022-02-011-2/+4
| | | | | | | | | | | | | We restore the count later, but this fix makes it so that the compiler doesn't get stuck in an infinite loop when you try to compile the following code: ``` fn main(): int { defer return 5; return 1; } ```
* Add basic `defer` implementation.Mustafa Quraish2022-02-014-0/+27
| | | | | | We don't have any closures yet, so it's essentially the same as just moving the statement after the `defer` keyword to the end of the block/ right before returning from the function.
* Global variables now supported! + some fixes to OP_ASSIGNMustafa Quraish2022-01-314-42/+93
| | | | | | Previously we weren't creating a new assignment node, and this was causing all sorts of funky errors. This commit also fixes that, and we can now use global variables :^)
* Add ability to import other filesMustafa Quraish2022-01-314-3/+62
| | | | | | This still requires a lot of work to avoid duplicate imports, and handle cyclic imports, but it is a good enough for small examples which just want to include some common definitions from std/
* Minor fixes to code generationMustafa Quraish2022-01-311-11/+11
| | | | | | | (1) Prefix function names with `func_` now, to avoid clasing with possible NASM keywords (such as `abs`). (2) Label numbers are now properly handed for conditional expressions.
* Fix offset for local variablesMustafa Quraish2022-01-311-1/+1
| | | | | | The stack actually grows downwards, so we need to account for that. The previous implementation was incorrect and exploded if you tried to use local variables in other functions except main.
* Make `Lexer_new` return a pointer instead of the object.Mustafa Quraish2022-01-313-8/+8
|