To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 雋?肢???翁?????雋?肢???翁?????^ 11101000101100100011111110001110100010000011111100111111001111111000100110100101001111110011111100111111001111110011111111101000101100100011111110001110100010000011111100111111001111111000100110100101001111110011111100111111001111110011111101011110 e8b23f8e883f3f3f89a53f3f3f3f3fe8b23f8e883f3f3f89a53f3f3f3f3f5e
EUC-JP 雋?肢???翁???檉?雋?肢???翁???檉?^ 1111000010110100001111111011101111101000001111110011111100111111101100101010011100111111001111110011111110001111110001011011101100111111111100001011010000111111101110111110100000111111001111110011111110110010101001110011111100111111001111111000111111000101101110110011111101011110 f0b43fbbe83f3f3fb2a73f3f3f8fc5bb3ff0b43fbbe83f3f3fb2a73f3f3f8fc5bb3f5e
UTF-8 雋렎肢골렰렪翁골렰렑檉룁雋렎肢골렰렪翁골렰렑檉뢸^ 11101001100110111000101111101011101000001000111011101000100000101010001011101010101100111010100011101011101000001011000011101011101000001010101011100111101111111000000111101010101100111010100011101011101000001011000011101011101000001001000111100110101010101000100111101011101000111000000111101001100110111000101111101011101000001000111011101000100000101010001011101010101100111010100011101011101000001011000011101011101000001010101011100111101111111000000111101010101100111010100011101011101000001011000011101011101000001001000111100110101010101000100111101011101000101011100001011110 e99b8beba08ee882a2eab3a8eba0b0eba0aae7bf81eab3a8eba0b0eba091e6aa89eba381e99b8beba08ee882a2eab3a8eba0b0eba0aae7bf81eab3a8eba0b0eba091e6aa89eba2b85e
UHC 雋렎肢골렰렪翁골렰렑檉룁雋렎肢골렰렪翁골렰렑檉뢸^ 11110001111001101000111010100100111100101011011010110000111100011000111010111101100011101011100011101000101110101011000011110001100011101011110110001110101001101110111111100000101101111101111011110001111001101000111010100100111100101011011010110000111100011000111010111101100011101011100011101000101110101011000011110001100011101011110110001110101001101110111111100000101101111101110001011110 f1e68ea4f2b6b0f18ebd8eb8e8bab0f18ebd8ea6efe0b7def1e68ea4f2b6b0f18ebd8eb8e8bab0f18ebd8ea6efe0b7dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)