To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????????潔 001111110011111100111111001111110011111100111111001111110011111100111111001111111000110010001001 3f3f3f3f3f3f3f3f3f3f8c89
EUC-JP ??????????潔 001111110011111100111111001111110011111100111111001111110011111100111111001111111011011111101001 3f3f3f3f3f3f3f3f3f3fb7e9
UTF-8 쒀렲렋롅뤏쮱쨴죳칿쥙潔 111011001001001010000000111010111010000010110010111010111010000010001011111010111010000110000101111010111010010010001111111011001010111010110001111011001010100010110100111011001010001110110011111011001011100110111111111011001010010110011001111001101011110110010100 ec9280eba0b2eba08beba185eba48fecaeb1eca8b4eca3b3ecb9bfeca599e6bd94
UHC 쒀렲렋롅뤏쮱쨴죳칿쥙潔 10111110101011001000111010111111100011101010001010001110110010111000111110111111101010001000111010100100100011101010000110001110101011111000111010100010100011101100110010111110 beac8ebf8ea28ecb8fbfa88ea48ea18eaf8ea28eccbe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)