To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 贓?莊豆鋸????? 1110011011011001001111111110010010110101100100111010010010001011100110000011111100111111001111110011111100111111 e6d93fe4b593a48b983f3f3f3f3f
EUC-JP 贓?莊豆鋸????? 1110110011011011001111111110100010110111110001101010011010110101111110000011111100111111001111110011111100111111 ecdb3fe8b7c6a6b5f83f3f3f3f3f
UTF-8 贓렑莊豆鋸泥계갹吏렣 111010001011010010010011111010111010000010010001111010001000111010001010111010001011000110000110111010011000101110111000111011111010011110100011111010101011001110000100111010101011000010111001111011111010011110011110111010111010000010100011 e8b493eba091e88e8ae8b186e98bb8efa7a3eab384eab0b9efa79eeba0a3
UHC 贓렑莊豆鋸泥계갹吏렣 1110110111111100100011101010011011101101111101101101010011100111110010111110101011101100101100101011000011101000101100001011110111101100101001111000111010110100 edfc8ea6edf6d4e7cbeaecb2b0e8b0bdeca78eb4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)