To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 堰?????擬?????有→?乙??裔??異 1000100110000001001111110011111100111111001111110011111110001011010110110011111100111111001111110011111100111111100101110100110010000001101010000011111110001001101100110011111100111111111001011110000100111111001111111000100011011001 89813f3f3f3f3f8b5b3f3f3f3f3f974c81a83f89b33f3fe5e13f3f88d9
EUC-JP 堰?????擬??饔??有→?乙??裔??異 10110001111000010011111100111111001111110011111100111111101101011011110000111111001111111000111111101000111011110011111100111111110011011010110110100010101010100011111110110010101101010011111100111111111010101110001100111111001111111011000011011011 b1e13f3f3f3f3fb5bc3f3f8fe8ef3f3fcdada2aa3fb2b53f3feae33f3fb0db
UTF-8 堰묐쓷流쒒걡擬뺛걶饔낃퀣有→콢乙대즵裔됱뇴異 111001011010000010110000111010111010110010010000111011001001001110110111111011111010011110001010111011001001001010010010111010101011000110100001111001101001001110101100111010111011101010011011111010101011000110110110111010011010010110010100111010111000001010000011111011011000000010100011111001101001110010001001111000101000011010010010111011001011110110100010111001001011100110011001111010111000110010000000111011001010011010110101111010001010001110010100111010111001000010110001111010111000011110110100111001111001010110110000 e5a0b0ebac90ec93b7efa78aec9292eab1a1e693acebba9beab1b6e9a594eb8283ed80a3e69c89e28692ecbda2e4b999eb8c80eca6b5e8a394eb90b1eb87b4e795b0
UHC 堰묐쓷流쒒걡擬뺛걶饔낃퀣有→콢乙대즵裔됱뇴異 1110010111101000100100011110101110011101100101001110101011111100100111001110100110000001100010101110101111110100100101011110001110000001100111001110100010111101100001011110101010110011100101111110101011110011101000011110011010110001100110101110101111100000101101001110101110100011100001111110011111100000100010011110110010000111100110001110110010110110 e5e891eb9d94eafc9ce9818aebf495e3819ce8bd85eab397eaf3a1e6b19aebe0b4eba387e7e089ec8798ecb6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)