To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 蒻も?荏??諛?? 11100100111010001000001011100000001111111000100101100000001111110011111111100110100001110011111100111111 e4e882e03f89603f3fe6873f3f
EUC-JP 蒻も?荏??諛?? 11101000111010101010010011100010001111111011000111000001001111110011111111101011111001110011111100111111 e8eaa4e23fb1c13f3febe73f3f
UTF-8 蒻も븡荏멱쯁諛⒲돩 111010001001001010111011111000111000001010000010111010111011100010100001111010001000110110001111111010111010100110110001111011001010111110000001111010001010101110011011111000101001001010110010111010111000111110101001 e892bbe38282ebb8a1e88d8feba9b1ecaf81e8ab9be292b2eb8fa9
UHC 蒻も븡荏멱쯁諛⒲돩 111001011011011010101010111000101001010110001010111011001111101110111000111010001010100010011101111010111011000010101001111000111000100110101100 e5b6aae2958aecfbb8e8a89debb0a9e389ac

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)