To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???宜??鷹?き筌 0011111100111111001111111000101101011000001111110011111110010001111010010011111110000010101010111110001010100011 3f3f3f8b583f3f91e93f82abe2a3
EUC-JP ???宜??鷹?き筌 0011111100111111001111111011010110111001001111110011111111000010111010110011111110100100101011011110010010100101 3f3f3fb5b93f3fc2eb3fa4ade4a5
UTF-8 囹덈슢宜룡뵺鷹곷き筌 111011111010011010101001111010111000110110001000111011001000101010100010111001011010111010011100111010111010001110100001111010111011010110111010111010011011011110111001111010101011001110110111111000111000000110001101111001111010110110001100 efa6a9eb8d88ec8aa2e5ae9ceba3a1ebb5bae9b7b9eab3b7e3818de7ad8c
UHC 囹덈슢宜룡뵺鷹곷き筌 1110011110101010100010001110101110011010101011101110101111110001101101111110011010010100101110001110101111101101100000011110101110101010101011011110111110100111 e7aa88eb9aaeebf1b7e694b8ebed81ebaaadefa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)