To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 航??肛??厭?? 100011010111000100111111001111111110001111101000001111110011111110001001011111010011111100111111 8d713f3fe3e83f3f897d3f3f
EUC-JP 航??肛??厭?? 101110011101001000111111001111111110011011101010001111110011111110110001110111100011111100111111 b9d23f3fe6ea3f3fb1de3f3f
UTF-8 航쒙쉿肛됵슬厭엉씉 111010001000100010101010111011001001001010011001111011001000100110111111111010001000001010011011111010111001000010110101111011001000101010101100111001011000111010101101111011001001011110001001111011001001010010001001 e888aaec9299ec89bfe8829beb90b5ec8aace58eadec9789ec9489
UHC 航쒙쉿肛됵슬厭엉씉 111110011111111010011100111011111011110110110010111110011111110110001001111011111011110110111101111001101111010010111110111110111001110110100001 f9fe9cefbdb2f9fd89efbdbde6f4befb9da1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)