To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 澱詐??諸戡??趙???澱詐??諸戡??趙???^ 1001001101100010100011011011110000111111001111111000111110010100100111010100000100111111001111111110011011100010001111110011111100111111100100110110001010001101101111000011111100111111100011111001010010011101010000010011111100111111111001101110001000111111001111110011111101011110 93628dbc3f3f8f949d413f3fe6e23f3f3f93628dbc3f3f8f949d413f3fe6e23f3f3f5e
EUC-JP 澱詐??諸戡??趙???澱詐??諸戡??趙???^ 1100010111000011101110101011111000111111001111111011110111110100110110011010001000111111001111111110110011100100001111110011111100111111110001011100001110111010101111100011111100111111101111011111010011011001101000100011111100111111111011001110010000111111001111110011111101011110 c5c3babe3f3fbdf4d9a23f3fece43f3f3fc5c3babe3f3fbdf4d9a23f3fece43f3f3f5e
UTF-8 澱詐렰렚諸戡렰렏趙쿰렰렯澱詐렰렚諸戡렰렏趙쿰렰렯^ 11100110101111101011000111101000101010011001000011101011101000001011000011101011101000001001101011101000101010111011100011100110100010001010000111101011101000001011000011101011101000001000111111101000101101101001100111101100101111111011000011101011101000001011000011101011101000001010111111100110101111101011000111101000101010011001000011101011101000001011000011101011101000001001101011101000101010111011100011100110100010001010000111101011101000001011000011101011101000001000111111101000101101101001100111101100101111111011000011101011101000001011000011101011101000001010111101011110 e6beb1e8a990eba0b0eba09ae8abb8e688a1eba0b0eba08fe8b699ecbfb0eba0b0eba0afe6beb1e8a990eba0b0eba09ae8abb8e688a1eba0b0eba08fe8b699ecbfb0eba0b0eba0af5e
UHC 澱詐렰렚諸戡렰렏趙쿰렰렯澱詐렰렚諸戡렰렏趙쿰렰렯^ 11101110111111101101111011110001100011101011110110001110101011011111000010110011110010101111000110001110101111011000111010100101111100001110000111000100111100011000111010111101100011101011110011101110111111101101111011110001100011101011110110001110101011011111000010110011110010101111000110001110101111011000111010100101111100001110000111000100111100011000111010111101100011101011110001011110 eefedef18ebd8eadf0b3caf18ebd8ea5f0e1c4f18ebd8ebceefedef18ebd8eadf0b3caf18ebd8ea5f0e1c4f18ebd8ebc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)