To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 凋?賓才?畯?駿?? 100100101001110000111111100101010110111110001101110010110011111111111011011011110011111110001111011110000011111100111111 929c3f956f8dcb3ffb6f3f8f783f3f
EUC-JP 凋?賓才?畯?駿?? 11000011111111000011111111001001110100001011101011001101001111111000111111001101101110110011111110111101110110010011111100111111 c3fc3fc9d0bacd3f8fcdbb3fbdd93f3f
UTF-8 凋당賓才렱畯렜駿곌행 111001011000011110001011111010111000101110111001111010001011001110010011111001101000100110001101111010111010000010110001111001111001010110101111111010111010000010011100111010011010011110111111111010101011001110001100111011011001011010001001 e5878beb8bb9e8b393e6898deba0b1e795afeba09ce9a7bfeab38ced9689
UHC 凋당賓才렱畯렜駿곌행 1111000010111101101101001110011111011110101110011110111010100110100011101011111011110001111000011000111010101110111100011110011110110000111010101100011111100000 f0bdb4e7deb9eea68ebef1e18eaef1e7b0eac7e0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)