To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 寤μ????應??[寤μ????應??[^ 100110111000100010000011110010100011111100111111001111110011111110011100111001000011111100111111010110111001101110001000100000111100101000111111001111110011111100111111100111001110010000111111001111110101101101011110 9b8883ca3f3f3f3f9ce43f3f5b9b8883ca3f3f3f3f9ce43f3f5b5e
EUC-JP 寤μ?佾??應??[寤μ?佾??應??[^ 11010101111010001010011011001100001111111000111110110000111110110011111100111111110110001110011000111111001111110101101111010101111010001010011011001100001111111000111110110000111110110011111100111111110110001110011000111111001111110101101101011110 d5e8a6cc3f8fb0fb3f3fd8e63f3f5bd5e8a6cc3f8fb0fb3f3fd8e63f3f5b5e
UTF-8 寤μ뜲佾잒뿥應고뜜[寤μ뜲佾잒뿥應고뜜[^ 11100101101011111010010011001110101111001110101110011100101100101110010010111101101111101110110010011110100100101110101110111111101001011110011010000111100010011110101010110011101000001110101110011100100111000101101111100101101011111010010011001110101111001110101110011100101100101110010010111101101111101110110010011110100100101110101110111111101001011110011010000111100010011110101010110011101000001110101110011100100111000101101101011110 e5afa4cebceb9cb2e4bdbeec9e92ebbfa5e68789eab3a0eb9c9c5be5afa4cebceb9cb2e4bdbeec9e92ebbfa5e68789eab3a0eb9c9c5b5e
UHC 寤μ뜲佾잒뿥應고뜜[寤μ뜲佾잒뿥應고뜜[^ 111001111111010110100101111011001000110110110000111011001110101110011111111010001001011110100101111010111110101110110000111011011000110110011111010110111110011111110101101001011110110010001101101100001110110011101011100111111110100010010111101001011110101111101011101100001110110110001101100111110101101101011110 e7f5a5ec8db0eceb9fe897a5ebebb0ed8d9f5be7f5a5ec8db0eceb9fe897a5ebebb0ed8d9f5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)