To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嗚??揖??癒??寤λ?揖??癒⑤?偃 10011010011010100011111100111111100101110100101100111111001111111001011011111100001111110011111110011011100010001000001111001001001111111001011101001011001111110011111110010110111111001000011101000100001111111001100011101110 9a6a3f3f974b3f3f96fc3f3f9b8883c93f974b3f3f96fc87443f98ee
EUC-JP 嗚??揖??癒??寤λ?揖??癒??偃 110100111100101100111111001111111100110110101100001111110011111111001100111111100011111100111111110101011110100010100110110010110011111111001101101011000011111100111111110011001111111000111111001111111101000011110000 d3cb3f3fcdac3f3fccfe3f3fd5e8a6cb3fcdac3f3fccfe3f3fd0f0
UTF-8 嗚삳챿揖묋뿥癒뀁댇寤λ㉡揖묋뿥癒⑤툡偃 1110010110010111100110101110110010000010101100111110110010110001101111111110011010001111100101101110101110101100100010111110101110111111101001011110011110011001100100101110101110000000100000011110101110001100100001111110010110101111101001001100111010111011111000111000100110100001111001101000111110010110111010111010110010001011111010111011111110100101111001111001100110010010111000101001000110100100111011011000100010100001111001011000000110000011 e5979aec82b3ecb1bfe68f96ebac8bebbfa5e79992eb8081eb8c87e5afa4cebbe389a1e68f96ebac8bebbfa5e79992e291a4ed88a1e58183
UHC 嗚삳챿揖묋뿥癒뀁댇寤λ㉡揖묋뿥癒⑤툡偃 1110011111110000101110111110101110101010100011001110101111100111100100011110100010010111101001011110101110101000101100101110110010001000101100011110011111110101101001011110101110101000101100101110101111100111100100011110100010010111101001011110101110101000101010001110101110111000100110001110010111100111 e7f0bbebaa8cebe791e897a5eba8b2ec88b1e7f5a5eba8b2ebe791e897a5eba8a8ebb898e5e7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)