To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????v????vB 0011111100111111001111110011111101110110001111110011111100111111001111110111011001000010 3f3f3f3f763f3f3f3f7642
SJIS-WIN 趙?烝?v趙?烝?vB 111001101110001000111111111000000111111000111111011101101110011011100010001111111110000001111110001111110111011001000010 e6e23fe07e3f76e6e23fe07e3f7642
EUC-JP 趙?烝?v趙?烝?vB 111011001110010000111111110111111101111100111111011101101110110011100100001111111101111111011111001111110111011001000010 ece43fdfdf3f76ece43fdfdf3f7642
UTF-8 趙렓烝렎v趙렓烝렎vB 111010001011011010011001111010111010000010010011111001111000001110011101111010111010000010001110011101101110100010110110100110011110101110100000100100111110011110000011100111011110101110100000100011100111011001000010 e8b699eba093e7839deba08e76e8b699eba093e7839deba08e7642
UHC 趙렓烝렎v趙렓烝렎vB 11110000111000011000111010101000111100011111011010001110101001000111011011110000111000011000111010101000111100011111011010001110101001000111011001000010 f0e18ea8f1f68ea476f0e18ea8f1f68ea47642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)