To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ????????????蔭??玉?????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111100010001111110000111111001111111000101111001010001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f88fc3f3f8bca3f3f3f3f3f42
EUC-JP ?????????薏??蔭??玉?????B 0011111100111111001111110011111100111111001111110011111100111111001111111000111111011001110111100011111100111111101100001111111000111111001111111011011011001100001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f8fd9de3f3fb0fe3f3fb6cc3f3f3f3f3f42
UTF-8 溜브떱溜븐젿溜뷸뿼薏뺤쓻蔭띴슴玉숇젗溜븃렔B 11101111101001111000101111101011101110001000110011101011100101101011000111101111101001111000101111101011101110001001000011101100101000001011111111101111101001111000101111101011101101111011100011101011101111111011110011101000100101101000111111101011101110101010010011101100100100111011101111101000100101001010110111101011100111011011010011101100100010101011010011100111100011101000100111101100100010001000011111101100101000001001011111101111101001111000101111101011101110001000001111101011101000001001010001000010 efa78bebb88ceb96b1efa78bebb890eca0bfefa78bebb7b8ebbfbce8968febbaa4ec93bbe894adeb9db4ec8ab4e78e89ec8887eca097efa78bebb883eba09442
UHC 溜브떱溜븐젿溜뷸뿼薏뺤쓻蔭띴슴玉숇젗溜븃렔B 11101010111111101011101011101010101101101011011111101010111111101011101011101100101000001011000111101010111111101011101011100110100101111011110011101011111110111001010111101100100111011001011011101011111000111000110111100100101111011011111111101000101011001001100111101011101000001001001111101010111111101011101011101000100011101010100101000010 eafebaeab6b7eafebaeca0b1eafebae697bcebfb95ec9d96ebe38de4bdbfe8ac99eba093eafebae88ea942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)