To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ????∽?碎?ご 001111110011111100111111001111111000000111100100001111111110000111101010001111111000001010110010 3f3f3f3f81e43fe1ea3f82b2
EUC-JP ????∽?碎?ご 001111110011111100111111001111111010001011100110001111111110001011101100001111111010010010110100 3f3f3f3fa2e63fe2ec3fa4b4
UTF-8 閱묐떯璘∽쭏碎대ご 111010011001011010110001111010111010110010010000111010111001011010101111111011111010011110101111111000101000100010111101111011001010110110001111111001111010001010001110111010111000110010000000111000111000000110010100 e996b1ebac90eb96afefa7afe288bdecad8fe7a28eeb8c80e38194
UHC 閱묐떯璘∽쭏碎대ご 111001101111001110010001111010111000101110111111111011001101111010100001111011111010011110001000111000011110111110110100111010111010101010110100 e6f391eb8bbfecdea1efa788e1efb4ebaab4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)