To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 寤μ?飮??應??[寤μ?飮??應??[^ 1001101110001000100000111100101000111111100111110101101000111111001111111001110011100100001111110011111101011011100110111000100010000011110010100011111110011111010110100011111100111111100111001110010000111111001111110101101101011110 9b8883ca3f9f5a3f3f9ce43f3f5b9b8883ca3f9f5a3f3f9ce43f3f5b5e
EUC-JP 寤μ?飮??應??[寤μ?飮??應??[^ 1101010111101000101001101100110000111111110111011011101100111111001111111101100011100110001111110011111101011011110101011110100010100110110011000011111111011101101110110011111100111111110110001110011000111111001111110101101101011110 d5e8a6cc3fddbb3f3fd8e63f3f5bd5e8a6cc3fddbb3f3fd8e63f3f5b5e
UTF-8 寤μ뜴飮김뿥應뀁댇[寤μ뜴飮김뿥應뀁댇[^ 11100101101011111010010011001110101111001110101110011100101101001110100110100011101011101110101010111001100000001110101110111111101001011110011010000111100010011110101110000000100000011110101110001100100001110101101111100101101011111010010011001110101111001110101110011100101101001110100110100011101011101110101010111001100000001110101110111111101001011110011010000111100010011110101110000000100000011110101110001100100001110101101101011110 e5afa4cebceb9cb4e9a3aeeab980ebbfa5e68789eb8081eb8c875be5afa4cebceb9cb4e9a3aeeab980ebbfa5e68789eb8081eb8c875b5e
UHC 寤μ뜴飮김뿥應뀁댇[寤μ뜴飮김뿥應뀁댇[^ 111001111111010110100101111011001000110110110010111010111110011010110001111010001001011110100101111010111110101110110010111011001000100010110001010110111110011111110101101001011110110010001101101100101110101111100110101100011110100010010111101001011110101111101011101100101110110010001000101100010101101101011110 e7f5a5ec8db2ebe6b1e897a5ebebb2ec88b15be7f5a5ec8db2ebe6b1e897a5ebebb2ec88b15b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)