To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 厓??愉??油??馭??????o?庸??? 11111010100011010011111100111111100101101111100100111111001111111001011011111011001111110011111111101001011001100011111100111111001111110011111100111111001111111000001010001111001111111001011101100110001111110011111100111111 fa8d3f3f96f93f3f96fb3f3fe9663f3f3f3f3f3f828f3f97663f3f3f
EUC-JP 厓??愉??油??馭??堉??洹o?庸??? 100011111011010011000111001111110011111111001100111110110011111100111111110011001111110100111111001111111111000111000111001111110011111110001111101101111111110100111111001111111000111111000111101110101010001111101111001111111100110111000111001111110011111100111111 8fb4c73f3fccfb3f3fccfd3f3ff1c73f3f8fb7fd3f3f8fc7baa3ef3fcdc73f3f3f
UTF-8 厓쀢뼰愉꾤뙴油살탦馭귙뀼堉욆쯃洹o폋庸눸녠굻 111001011000111010010011111011001000000010100010111010111011110010110000111001101000010010001001111010101011111010100100111010111001100110110100111001101011001010111001111011001000001010110100111011011000001110100110111010011010011010101101111010101011011110011001111010111000000010111100111001011010000010001001111011001001101010000110111011001010111110000011111001101011010010111001111011111011110110001111111011011000111110001011111001011011101010111000111010111000100010111000111010111000010110100000111010101011010110111011 e58e93ec80a2ebbcb0e68489eabea4eb99b4e6b2b9ec82b4ed83a6e9a6adeab799eb80bce5a089ec9a86ecaf83e6b4b9efbd8fed8f8be5bab8eb88b8eb85a0eab5bb
UHC 厓쀢뼰愉꾤뙴油살탦馭귙뀼堉욆쯃洹o폋庸눸녠굻 1110010011101101100101111110001010010110101100111110101011110000100001001110011110001100101101111110101011111010101110111110110010110101100010001110010111011111100000101110001110000101101100101110101110111100100111101110100010101000100111111110101010110111101000111110111110111100100101101110100110111100100001111100111010110011111010101011000110111111 e4ed97e296b3eaf084e78cb7eafabbecb588e5df82e385b2ebbc9ee8a89feab7a3efbc96e9bc87ceb3eab1bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)