To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????±?±???± 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111110110001001111111011000100111111001111110011111110110001 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3fb13fb13f3f3fb1
SJIS-WIN ???雲徐????????上?±?±?刮?± 0011111100111111001111111000100101011111100011111001100100111111001111110011111100111111001111110011111100111111001111111000111111100011001111111000000101111101001111111000000101111101001111111001100110001001001111111000000101111101 3f3f3f895f8f993f3f3f3f3f3f3f3f8fe33f817d3f817d3f99893f817d
EUC-JP ???雲徐??庾?????上?±?±?刮?± 00111111001111110011111110110001110000001011110111111001001111110011111110001111101111001100111000111111001111110011111100111111001111111011111011100101001111111010000111011110001111111010000111011110001111111101000111101001001111111010000111011110 3f3f3fb1c0bdf93f3f8fbcce3f3f3f3f3fbee53fa1de3fa1de3fd1e93fa1de
UTF-8 쒀렲쒀雲徐렋뤊庾찊첸춲첁죳上춲±춲±쳩刮춲± 111011001001001010000000111010111010000010110010111011001001001010000000111010011001101110110010111001011011111010010000111010111010000010001011111010111010010010001010111001011011101010111110111011001011000010001010111011001011001010111000111011001011011010110010111011001011001010000001111011001010001110110011111001001011100010001010111011001011011010110010110000101011000111101100101101101011001011000010101100011110110010110011101010011110010110001000101011101110110010110110101100101100001010110001 ec9280eba0b2ec9280e99bb2e5be90eba08beba48ae5babeecb08aecb2b8ecb6b2ecb281eca3b3e4b88aecb6b2c2b1ecb6b2c2b1ecb3a9e588aeecb6b2c2b1
UHC 쒀렲쒀雲徐렋뤊庾찊첸춲첁죳上춲±춲±쳩刮춲± 1011111010101100100011101011111110111110101011001110101010100011110111111110111110001110101000101000111110111010111010101110110010101001100011101100001110111110101011011000111010101010100011101010000110001110110111111011111010101101100011101010000110111110101011011000111010100001101111101010101110001110110011101011111010101101100011101010000110111110 beac8ebfbeaceaa3dfef8ea28fbaeaeca98ec3bead8eaa8ea18edfbead8ea1bead8ea1beab8ecebead8ea1be

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)