To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???v???vB 001111110011111100111111011101100011111100111111001111110111011001000010 3f3f3f763f3f3f7642
SJIS-WIN 繕??v繕??vB 1001000101010101001111110011111101110110100100010101010100111111001111110111011001000010 91553f3f7691553f3f7642
EUC-JP 繕?琰v繕?琰vB 110000011011011000111111100011111100110010110100011101101100000110110110001111111000111111001100101101000111011001000010 c1b63f8fccb476c1b63f8fccb47642
UTF-8 繕쏁琰v繕쏁琰vB 111001111011100110010101111011001000111110000001111001111001000010110000011101101110011110111001100101011110110010001111100000011110011110010000101100000111011001000010 e7b995ec8f81e790b076e7b995ec8f81e790b07642
UHC 繕쏁琰v繕쏁琰vB 111000001100101110011011111001111110011011111100011101101110000011001011100110111110011111100110111111000111011001000010 e0cb9be7e6fc76e0cb9be7e6fc7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)