To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?®?n}v?®?n}vB 00111111101011100011111101101110011111010111011000111111101011100011111101101110011111010111011001000010 3fae3f6e7d763fae3f6e7d7642
SJIS-WIN ???n}v???n}vB 00111111001111110011111101101110011111010111011000111111001111110011111101101110011111010111011001000010 3f3f3f6e7d763f3f3f6e7d7642
EUC-JP ?®?n}v?®?n}vB 0011111110001111101000101110111000111111011011100111110101110110001111111000111110100010111011100011111101101110011111010111011001000010 3f8fa2ee3f6e7d763f8fa2ee3f6e7d7642
UTF-8 鍊®삂n}v鍊®삂n}vB 1110111110100110100110111100001010101110111011001000001010000010011011100111110101110110111011111010011010011011110000101010111011101100100000101000001001101110011111010111011001000010 efa69bc2aeec82826e7d76efa69bc2aeec82826e7d7642
UHC 鍊®삂n}v鍊®삂n}vB 11100110111010001010001011100111100110001000100101101110011111010111011011100110111010001010001011100111100110001000100101101110011111010111011001000010 e6e8a2e798896e7d76e6e8a2e798896e7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)