To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
EUC-JP ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
UTF-8 횙짯챨찾챨챕찼횧횘첵철째횙짯찼횚찼횦횙짯철째 111011011001101010011001111011001010011110101111111011001011000110101000111011001011000010111110111011001011000110101000111011001011000110010101111011001011000010111100111011011001101010100111111011011001101010011000111011001011001010110101111011001011001010100000111011001010011110111000111011011001101010011001111011001010011110101111111011001011000010111100111011011001101010011010111011001011000010111100111011011001101010100110111011011001101010011001111011001010011110101111111011001011001010100000111011001010011110111000 ed9a99eca7afecb1a8ecb0beecb1a8ecb195ecb0bced9aa7ed9a98ecb2b5ecb2a0eca7b8ed9a99eca7afecb0bced9a9aecb0bced9aa6ed9a99eca7afecb2a0eca7b8
UHC 횙짯챨찾챨챕찼횧횘첵철째횙짯찼횚찼횦횙짯철째 1100001110010011110000101010110111000011101100001100001110100011110000111011000011000011101010011100001110100001110000111001111011000011100100101100001110111101110000111011011011000010101100001100001110010011110000101010110111000011101000011100001110010100110000111010000111000011100111011100001110010011110000101010110111000011101101101100001010110000 c393c2adc3b0c3a3c3b0c3a9c3a1c39ec392c3bdc3b6c2b0c393c2adc3a1c394c3a1c39dc393c2adc3b6c2b0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)