To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???znf???zn^}Y???znf???zn^}bE 0011111100111111001111110111101001101110011001100011111100111111001111110111101001101110010111100111110101011001001111110011111100111111011110100110111001100110001111110011111100111111011110100110111001011110011111010110001001000101 3f3f3f7a6e663f3f3f7a6e5e7d593f3f3f7a6e663f3f3f7a6e5e7d6245
SJIS-WIN 殲閃蕭znf殲閃蕭zn^}Y殲閃蕭znf殲閃蕭zn^}bE 1001111101110010100100010100110111100101010010100111101001101110011001101001111101110010100100010100110111100101010010100111101001101110010111100111110101011001100111110111001010010001010011011110010101001010011110100110111001100110100111110111001010010001010011011110010101001010011110100110111001011110011111010110001001000101 9f72914de54a7a6e669f72914de54a7a6e5e7d599f72914de54a7a6e669f72914de54a7a6e5e7d6245
EUC-JP 殲閃蕭znf殲閃蕭zn^}Y殲閃蕭znf殲閃蕭zn^}bE 1101110111010011110000011010111011101001101010110111101001101110011001101101110111010011110000011010111011101001101010110111101001101110010111100111110101011001110111011101001111000001101011101110100110101011011110100110111001100110110111011101001111000001101011101110100110101011011110100110111001011110011111010110001001000101 ddd3c1aee9ab7a6e66ddd3c1aee9ab7a6e5e7d59ddd3c1aee9ab7a6e66ddd3c1aee9ab7a6e5e7d6245
UTF-8 殲閃蕭znf殲閃蕭zn^}Y殲閃蕭znf殲閃蕭zn^}bE 1110011010101110101100101110100110010110100000111110100010010101101011010111101001101110011001101110011010101110101100101110100110010110100000111110100010010101101011010111101001101110010111100111110101011001111001101010111010110010111010011001011010000011111010001001010110101101011110100110111001100110111001101010111010110010111010011001011010000011111010001001010110101101011110100110111001011110011111010110001001000101 e6aeb2e99683e895ad7a6e66e6aeb2e99683e895ad7a6e5e7d59e6aeb2e99683e895ad7a6e66e6aeb2e99683e895ad7a6e5e7d6245
UHC 殲閃蕭znf殲閃蕭zn^}Y殲閃蕭znf殲閃蕭zn^}bE 1110000011101000111000001110110011100001110010110111101001101110011001101110000011101000111000001110110011100001110010110111101001101110010111100111110101011001111000001110100011100000111011001110000111001011011110100110111001100110111000001110100011100000111011001110000111001011011110100110111001011110011111010110001001000101 e0e8e0ece1cb7a6e66e0e8e0ece1cb7a6e5e7d59e0e8e0ece1cb7a6e66e0e8e0ece1cb7a6e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)