To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????R?????^[?????R?????^[^ 001111110011111100111111001111110011111101010010001111110011111100111111001111110011111101011110010110110011111100111111001111110011111100111111010100100011111100111111001111110011111100111111010111100101101101011110 3f3f3f3f3f523f3f3f3f3f5e5b3f3f3f3f3f523f3f3f3f3f5e5b5e
SJIS-WIN ???釗?R???釗?^[???釗?R???釗?^[^ 00111111001111110011111111111011101110110011111101010010001111110011111100111111111110111011101100111111010111100101101100111111001111110011111111111011101110110011111101010010001111110011111100111111111110111011101100111111010111100101101101011110 3f3f3ffbbb3f523f3f3ffbbb3f5e5b3f3f3ffbbb3f523f3f3ffbbb3f5e5b5e
EUC-JP ???釗?R???釗?^[???釗?R???釗?^[^ 0011111100111111001111111000111111100011101001100011111101010010001111110011111100111111100011111110001110100110001111110101111001011011001111110011111100111111100011111110001110100110001111110101001000111111001111110011111110001111111000111010011000111111010111100101101101011110 3f3f3f8fe3a63f523f3f3f8fe3a63f5e5b3f3f3f8fe3a63f523f3f3f8fe3a63f5e5b5e
UTF-8 列룸씈釗큡R列룸씈釗큡^[列룸씈釗큡R列룸씈釗큡^[^ 11101111101001101001110011101011101000111011100011101100100101001000100011101001100001111001011111101101100000011010000101010010111011111010011010011100111010111010001110111000111011001001010010001000111010011000011110010111111011011000000110100001010111100101101111101111101001101001110011101011101000111011100011101100100101001000100011101001100001111001011111101101100000011010000101010010111011111010011010011100111010111010001110111000111011001001010010001000111010011000011110010111111011011000000110100001010111100101101101011110 efa69ceba3b8ec9488e98797ed81a152efa69ceba3b8ec9488e98797ed81a15e5befa69ceba3b8ec9488e98797ed81a152efa69ceba3b8ec9488e98797ed81a15e5b5e
UHC 列룸씈釗큡R列룸씈釗큡^[列룸씈釗큡R列룸씈釗큡^[^ 1110011011101010101101111110101110011101101000001110000111110010101101000110111001010010111001101110101010110111111010111001110110100000111000011111001010110100011011100101111001011011111001101110101010110111111010111001110110100000111000011111001010110100011011100101001011100110111010101011011111101011100111011010000011100001111100101011010001101110010111100101101101011110 e6eab7eb9da0e1f2b46e52e6eab7eb9da0e1f2b46e5e5be6eab7eb9da0e1f2b46e52e6eab7eb9da0e1f2b46e5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)