To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??韋??應??域??宋??筌??循? 111000011001111100111111001111111110100011101000001111110011111110011100111001000011111100111111100010001110011000111111001111111001000101110110001111110011111111100010101000110011111100111111100011110111101000111111 e19f3f3fe8e83f3f9ce43f3f88e63f3f91763f3fe2a33f3f8f7a3f
EUC-JP 癲??韋??應??域??宋??筌??循? 111000101010000100111111001111111111000011101010001111110011111111011000111001100011111100111111101100001110100000111111001111111100000111010111001111110011111111100100101001010011111100111111101111011101101100111111 e2a13f3ff0ea3f3fd8e63f3fb0e83f3fc1d73f3fe4a53f3fbddb3f
UTF-8 癲욌맧韋껂뭄應뱀녇域뱀씓宋볦녃筌딄퍔循쯆 111001111001100110110010111011001001101010001100111010111010011110100111111010011001111110001011111010101011101110000010111010111010110110000100111001101000011110001001111010111011000110000000111010111000010110000111111001011001111110011111111010111011000110000000111011001001010010010011111001011010111010001011111010111011001110100110111010111000010110000011111001111010110110001100111010111001010010000100111011011000110110010100111001011011111010101010111011001010111110000110 e799b2ec9a8ceba7a7e99f8beabb82ebad84e68789ebb180eb8587e59f9febb180ec9493e5ae8bebb3a6eb8583e7ad8ceb9484ed8d94e5beaaecaf86
UHC 癲욌맧韋껂뭄應뱀녇域뱀씓宋볦녃筌딄퍔循쯆 11101111101001101001111011101011100100001011000011101010110111111000001111100100101110011011001111101011111010111011100111101100100001101011111011100110101101001011100111101100100111011010100111100001111001001001001111101100100001101011101111101111101001111000101011101010101110111000101111100010111000001010100101000010 efa69eeb90b0eadf83e4b9b3ebebb9ec86bee6b4b9ec9da9e1e493ec86bbefa78aeabb8be2e0a942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)