To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 螯コ閠ウ蟯ヲ遶コ蠏 1110010110100110101110101110100010000000101100111110010110110010101001101110011110101011101110101110010110110101 e5a6bae880b3e5b2a6e7abbae5b5
EUC-JP 螯コ閠ウ蟯ヲ遶コ蠏 111010101010100010001110101110101110111111100000100011101011001111101010101101001000111010100110111011101010110110001110101110101110101010110111 eaa88ebaefe08eb3eab48ea6eead8ebaeab7
UTF-8 螯コ閠ウ蟯ヲ遶コ蠏 111010001001111010101111111011111011110110111010111010011001011010100000111011111011110110110011111010001001111110101111111011111011110110100110111010011000000110110110111011111011110110111010111010001010000010001111 e89eafefbdbae996a0efbdb3e89fafefbda6e981b6efbdbae8a08f
UHC ????蟯???? 00111111001111110011111100111111111010011010100000111111001111110011111100111111 3f3f3f3fe9a83f3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)