To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????m 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f6d
SJIS-WIN ????????????????????????m 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f6d
EUC-JP ????????????????????????m 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f6d
UTF-8 셔샹셍렯렼렾렽석셍선셍롔렽섐셍섭셍롖렽서셍렎렼셉m 11101100100001011001010011101100100000111011100111101100100001011000110111101011101000001010111111101011101000001011110011101011101000001011111011101011101000001011110111101100100001001001110111101100100001011000110111101100100001001010000011101100100001011000110111101011101000011001010011101011101000001011110111101100100001001001000011101100100001011000110111101100100001001010110111101100100001011000110111101011101000011001011011101011101000001011110111101100100001001001110011101100100001011000110111101011101000001000111011101011101000001011110011101100100001011000100101101101 ec8594ec83b9ec858deba0afeba0bceba0beeba0bdec849dec858dec84a0ec858deba194eba0bdec8490ec858dec84adec858deba196eba0bdec849cec858deba08eeba0bcec85896d
UHC 셔샹셍렯렼렾렽석셍선셍롔렽섐셍섭셍롖렽서셍렎렼셉m 10111100110001011011110010100111101111001100010010001110101111001000111011000100100011101100011010001110110001011011110010101110101111001100010010111100101100011011110011000100100011101101100010001110110001011011110010101011101111001100010010111100101101111011110011000100100011101101101010001110110001011011110010101101101111001100010010001110101001001000111011000100101111001100000101101101 bcc5bca7bcc48ebc8ec48ec68ec5bcaebcc4bcb1bcc48ed88ec5bcabbcc4bcb7bcc48eda8ec5bcadbcc48ea48ec4bcc16d

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)