To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 擁??旬????????荏??齬??繹??B 100101110110100100111111001111111000111101111011001111110011111100111111001111110011111100111111001111110011111110001001011000000011111100111111111010101001011100111111001111111110001110001000001111110011111101000010 97693f3f8f7b3f3f3f3f3f3f3f3f89603f3fea973f3fe3883f3f42
EUC-JP 擁??旬????????荏??齬??繹??B 110011011100101000111111001111111011110111011100001111110011111100111111001111110011111100111111001111110011111110110001110000010011111100111111111100111111011100111111001111111110010111101000001111110011111101000010 cdca3f3fbddc3f3f3f3f3f3f3f3fb1c13f3ff3f73f3fe5e83f3f42
UTF-8 擁숉쓼旬욘깱溜뗦틦溜계텤荏쀣꽴齬잙젉繹먮젾B 11100110100100111000000111101100100010001000100111101100100100111011110011100110100101111010110011101100100110101001100011101010101110011011000111101111101001111000101111101011100101111010011011101101100010111010011011101111101001111000101111101010101100111000010011101101100001011010010011101000100011011000111111101100100000001010001111101010101111011011010011101001101111011010110011101100100111101001100111101100101000001000100111100111101110011011100111101011101010001010111011101100101000001011111001000010 e69381ec8889ec93bce697acec9a98eab9b1efa78beb97a6ed8ba6efa78beab384ed85a4e88d8fec80a3eabdb4e9bdacec9e99eca089e7b9b9eba8aeeca0be42
UHC 擁숉쓼旬욘깱溜뗦틦溜계텤荏쀣꽴齬잙젉繹먮젾B 11101000101101101001100111101101100111011001011111100010111000101011111111100110100000111001111111101010111111101000101111100110101110101001000011101010111111101011000011101000101101101001100111101100111110111001011111100011100001001011111111100101111000011001111111101011101000001000101111100110101110101001000011101011101000001011000001000010 e8b699ed9d97e2e2bfe6839feafe8be6ba90eafeb0e8b699ecfb97e384bfe5e19feba08be6ba90eba0b042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)