To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 貂サ隴エ鬲エ阡。雎ク 111001101011100010111011111010001010110110110100111010011010110110110100111010001001010010100001111010001011000110111000 e6b8bbe8adb4e9adb4e894a1e8b1b8
EUC-JP 貂サ隴エ鬲エ阡。雎ク 1110110010111010100011101011101111110000101011111000111010110100111100101010111110001110101101001110111111110100100011101010000111110000101100111000111010111000 ecba8ebbf0af8eb4f2af8eb4eff48ea1f0b38eb8
UTF-8 貂サ隴エ鬲エ阡。雎ク 111010001011001010000010111011111011110110111011111010011001101010110100111011111011110110110100111010011010110010110010111011111011110110110100111010011001100010100001111011111011110110100001111010011001101110001110111011111011110110111000 e8b282efbdbbe99ab4efbdb4e9acb2efbdb4e998a1efbda1e99b8eefbdb8
UHC 貂?????阡?雎? 11110101101100000011111100111111001111110011111100111111111101001100011000111111111011101101000100111111 f5b03f3f3f3f3ff4c63feed13f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)