To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????\??????A 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110101110000111111001111110011111100111111001111110011111101000001 3f3f3f3f3f3f3f3f3f3f3f3f3f3f5c3f3f3f3f3f3f41
SJIS-WIN ??????????????\??????A 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110101110000111111001111110011111100111111001111110011111101000001 3f3f3f3f3f3f3f3f3f3f3f3f3f3f5c3f3f3f3f3f3f41
EUC-JP ??????????????\??????A 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110101110000111111001111110011111100111111001111110011111101000001 3f3f3f3f3f3f3f3f3f3f3f3f3f3f5c3f3f3f3f3f3f41
UTF-8 횄짚횂혵횂혖횄짚횂혖횂짜횄짢\횂혙횂혧횂혧A 1110110110011010100001001110110010100111100110101110110110011010100000101110110110011000101101011110110110011010100000101110110110011000100101101110110110011010100001001110110010100111100110101110110110011010100000101110110110011000100101101110110110011010100000101110110010100111100111001110110110011010100001001110110010100111101000100101110011101101100110101000001011101101100110001001100111101101100110101000001011101101100110001010011111101101100110101000001011101101100110001010011101000001 ed9a84eca79aed9a82ed98b5ed9a82ed9896ed9a84eca79aed9a82ed9896ed9a82eca79ced9a84eca7a25ced9a82ed9899ed9a82ed98a7ed9a82ed98a741
UHC 횄짚횂혵횂혖횄짚횂혖횂짜횄짢\횂혙횂혧횂혧A 110000111000001111000010101001001100001110000010110000101001110011000011100000101100001010000001110000111000001111000010101001001100001110000010110000101000000111000011100000101100001010100101110000111000001111000010101010000101110011000011100000101100001010000100110000111000001011000010100011111100001110000010110000101000111101000001 c383c2a4c382c29cc382c281c383c2a4c382c281c382c2a5c383c2a85cc382c284c382c28fc382c28f41

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)