To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????ByB 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111010000100111100101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f427942
SJIS-WIN 姨??姨?㎞姨??姨??姨??姨??ByB 10011011010010000011111100111111100110110100100000111111100001110111000110011011010010000011111100111111100110110100100000111111001111111001101101001000001111110011111110011011010010000011111100111111010000100111100101000010 9b483f3f9b483f87719b483f3f9b483f3f9b483f3f9b483f3f427942
EUC-JP 姨??姨??姨??姨??姨??姨??ByB 110101011010100100111111001111111101010110101001001111110011111111010101101010010011111100111111110101011010100100111111001111111101010110101001001111110011111111010101101010010011111100111111010000100111100101000010 d5a93f3fd5a93f3fd5a93f3fd5a93f3fd5a93f3fd5a93f3f427942
UTF-8 姨뚰쉯姨뚯㎞姨뚰쉯姨뚯쮰姨뚰쉮姨뚯쮬ByB 111001011010011110101000111010111001101010110000111011001000100110101111111001011010011110101000111010111001101010101111111000111000111010011110111001011010011110101000111010111001101010110000111011001000100110101111111001011010011110101000111010111001101010101111111011001010111010110000111001011010011110101000111010111001101010110000111011001000100110101110111001011010011110101000111010111001101010101111111011001010111010101100010000100111100101000010 e5a7a8eb9ab0ec89afe5a7a8eb9aafe38e9ee5a7a8eb9ab0ec89afe5a7a8eb9aafecaeb0e5a7a8eb9ab0ec89aee5a7a8eb9aafecaeac427942
UHC 姨뚰쉯姨뚯㎞姨뚰쉯姨뚯쮰姨뚰쉮姨뚯쮬ByB 111011001010100110001100111011011001101010000111111011001010100110001100111011001010011110110000111011001010100110001100111011011001101010000111111011001010100110001100111011001010100010001101111011001010100110001100111011011001101010000110111011001010100110001100111011001010100010001001010000100111100101000010 eca98ced9a87eca98ceca7b0eca98ced9a87eca98ceca88deca98ced9a86eca98ceca889427942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)