To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN セ褥諶レ上貍シ 1011111011100101111100011111101110101010110110101000111111100011111001101011110010111100 bee5f1fbaada8fe3e6bcbc
EUC-JP セ褥諶レ上貍シ 100011101011111011101010111100111000111111011110101101011000111011011010101111101110010111101100101111101000111010111100 8ebeeaf38fdeb58edabee5ecbe8ebc
UTF-8 セ褥諶レ上貍シ 111011111011110110111110111010001010010010100101111010001010101110110110111011111011111010011010111001001011100010001010111010001011001010001101111011111011110110111100 efbdbee8a4a5e8abb6efbe9ae4b88ae8b28defbdbc
UHC ?褥諶?上?? 00111111111010011011001111100100101001100011111111011111101111100011111100111111 3fe9b3e4a63fdfbe3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)