To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 額?????蟻?? 1000101001111010001111110011111100111111001111110011111110001011011000010011111100111111 8a7a3f3f3f3f3f8b613f3f
EUC-JP 額?????蟻?? 1011001111011011001111110011111100111111001111110011111110110101110000100011111100111111 b3db3f3f3f3f3fb5c23f3f
UTF-8 額딆빖梨뤄쭓蟻숈젷 111010011010000110001101111010111001010010000110111010111011100110010110111011111010011110100010111010111010010010000100111011001010110110010011111010001001111110111011111011001000100010001000111011001010000010110111 e9a18deb9486ebb996efa7a2eba484ecad93e89fbbec8888eca0b7
UHC 額딆빖梨뤄쭓蟻숈젷 111001001111111010001010111011001001010110111000111011001011000110110111111011111010011110001011111010111111110010011001111011001010000010101011 e4fe8aec95b8ecb1b7efa78bebfc99eca0ab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)