To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????h? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110100000111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f683f
SJIS-WIN ???姨??夷?????????ъ?姨?㎞h? 00111111001111110011111110011011010010000011111100111111100010001100111000111111001111110011111100111111001111110011111100111111001111110011111110000100100011000011111110011011010010000011111110000111011100010110100000111111 3f3f3f9b483f3f88ce3f3f3f3f3f3f3f3f3f848c3f9b483f8771683f
EUC-JP ???姨??夷?????????ъ?姨??h? 001111110011111100111111110101011010100100111111001111111011000011010000001111110011111100111111001111110011111100111111001111110011111100111111101001111110110000111111110101011010100100111111001111110110100000111111 3f3f3fd5a93f3fb0d03f3f3f3f3f3f3f3f3fa7ec3fd5a93f3f683f
UTF-8 梨뺥삸姨뚯콉夷랁삻淋볦쭠梨뷀삨吏ъ괜姨띿㎞h梨 111011111010011110100010111010111011101010100101111011001000001010111000111001011010011110101000111010111001101010101111111011001011110110001001111001011010010010110111111010111001111010000001111011001000001010111011111011111010011110110101111010111011001110100110111011001010110110100000111011111010011110100010111010111011011110000000111011001000001010101000111011111010011110011110110100011000101011101010101101001001110011100101101001111010100011101011100111011011111111100011100011101001111001101000111011111010011110100010 efa7a2ebbaa5ec82b8e5a7a8eb9aafecbd89e5a4b7eb9e81ec82bbefa7b5ebb3a6ecada0efa7a2ebb780ec82a8efa79ed18aeab49ce5a7a8eb9dbfe38e9e68efa7a2
UHC 梨뺥삸姨뚯콉夷랁삻淋볦쭠梨뷀삨吏ъ괜姨띿㎞h梨 111011001011000110010101111011011001100010101111111011001010100110001100111011001011000110000101111011001010100010001101111011011001100010110010111011001111100010010011111011001010011110010101111011001011000110010100111011011001100010100111111011001010011110101100111011001011000110100110111011001010100110001101111011001010011110110000011010001110110010110001 ecb195ed98afeca98cecb185eca88ded98b2ecf893eca795ecb194ed98a7eca7acecb1a6eca98deca7b068ecb1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)