To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 墻??撓??厓?????絶??節??帳??B 10011010110101000011111100111111100111011001101000111111001111111111101010001101001111110011111100111111001111110011111110010000111000100011111100111111100100001101111100111111001111111001001010100000001111110011111101000010 9ad43f3f9d9a3f3ffa8d3f3f3f3f3f90e23f3f90df3f3f92a03f3f42
EUC-JP 墻??撓??厓?????絶??節??帳??B 1101010011010110001111110011111111011001111110100011111100111111100011111011010011000111001111110011111100111111001111110011111111000000111001000011111100111111110000001110000100111111001111111100010010100010001111110011111101000010 d4d63f3fd9fa3f3f8fb4c73f3f3f3f3fc0e43f3fc0e13f3fc4a23f3f42
UTF-8 墻⑼푶撓뷂쉔厓까뀤連어겑絶꾦뿏節ⓨ뤀帳쏁댚B 11100101101000101011101111100010100100011011110011101101100100011011011011100110100100101001001111101011101101111000001011101100100010011001010011100101100011101001001111101010101110011000110011101011100000001010010011101111101001101001101011101100100101101011010011101010101100101001000111100111101101011011011011101010101111101010011011101011101111111000111111100111101011111000000011100010100100111010100011101011101001001000000011100101101110001011001111101100100011111000000111101011100011001001101001000010 e5a2bbe291bced91b6e69293ebb782ec8994e58e93eab98ceb80a4efa69aec96b4eab291e7b5b6eabea6ebbf8fe7af80e293a8eba480e5b8b3ec8f81eb8c9a42
UHC 墻⑼푶撓뷂쉔厓까뀤連어겑絶꾦뿏節ⓨ뤀帳쏁댚B 11101101110111111010100111101111101111101000010011101000111101011001010011101111101111011010100011100100111011011011000111101110100001011001101111100110111001101011111011101110100000011010100111101111101111101000010011101001100101111001010011101111101111011010100011100101100011111011000111101101111000111001101111100111100010001011111001000010 eddfa9efbe84e8f594efbda8e4edb1ee859be6e6beee81a9efbe84e99794efbda8e58fb1ede39be788be42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)