To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 淨???怨棒??瀞??伊豆勿 100111111100010000111111001111110011111110001001100001011001011001011111001111110011111110010011110100100011111100111111100010001100100110010011101001001001011011011100 9fc43f3f3f8985965f3f3f93d23f3f88c993a496dc
EUC-JP 淨???怨棒??瀞??伊豆勿 110111101100011000111111001111110011111110110001111001011100101111000000001111110011111111000110110101000011111100111111101100001100101111000110101001101100110011011110 dec63f3f3fb1e5cbc03f3fc6d43f3fb0cbc6a6ccde
UTF-8 淨렠履렰怨棒렟렩瀞펠긺伊豆勿 111001101011011110101000111010111010000010100000111011111010011110011111111010111010000010110000111001101000000010101000111001101010001110010010111010111010000010011111111010111010000010101001111001111000000010011110111011011000111010100000111010101011100010111010111001001011110010001010111010001011000110000110111001011000101110111111 e6b7a8eba0a0efa79feba0b0e680a8e6a392eba09feba0a9e7809eed8ea0eab8bae4bc8ae8b186e58bbf
UHC 淨렠履렰怨棒렟렩瀞펠긺伊豆勿 11101111111001001000111010110001111011001010101010001110101111011110101010110011110111001110101010001110101100001000111010110111111011111110011111000110111001111011000111100111111011001010010111010100111001111101101010101000 efe48eb1ecaa8ebdeab3dcea8eb08eb7efe7c6e7b1e7eca5d4e7daa8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)