To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????h 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f68
SJIS-WIN ???姨??夷?????????ъ?姨?㎞h 001111110011111100111111100110110100100000111111001111111000100011001110001111110011111100111111001111110011111100111111001111110011111100111111100001001000110000111111100110110100100000111111100001110111000101101000 3f3f3f9b483f3f88ce3f3f3f3f3f3f3f3f3f848c3f9b483f877168
EUC-JP ???姨??夷?????????ъ?姨??h 0011111100111111001111111101010110101001001111110011111110110000110100000011111100111111001111110011111100111111001111110011111100111111001111111010011111101100001111111101010110101001001111110011111101101000 3f3f3fd5a93f3fb0d03f3f3f3f3f3f3f3f3fa7ec3fd5a93f3f68
UTF-8 梨붿쿃姨뚯콉夷랁삻淋볦쭠梨뚯콬吏ъ괜姨띿㎞h 111011111010011110100010111010111011011010111111111011001011111110000011111001011010011110101000111010111001101010101111111011001011110110001001111001011010010010110111111010111001111010000001111011001000001010111011111011111010011110110101111010111011001110100110111011001010110110100000111011111010011110100010111010111001101010101111111011001011110110101100111011111010011110011110110100011000101011101010101101001001110011100101101001111010100011101011100111011011111111100011100011101001111001101000 efa7a2ebb6bfecbf83e5a7a8eb9aafecbd89e5a4b7eb9e81ec82bbefa7b5ebb3a6ecada0efa7a2eb9aafecbdacefa79ed18aeab49ce5a7a8eb9dbfe38e9e68
UHC 梨붿쿃姨뚯콉夷랁삻淋볦쭠梨뚯콬吏ъ괜姨띿㎞h 11101100101100011001010011101100101100101001100111101100101010011000110011101100101100011000010111101100101010001000110111101101100110001011001011101100111110001001001111101100101001111001010111101100101100011000110011101100101100011010000011101100101001111010110011101100101100011010011011101100101010011000110111101100101001111011000001101000 ecb194ecb299eca98cecb185eca88ded98b2ecf893eca795ecb18cecb1a0eca7acecb1a6eca98deca7b068

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)