To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
EUC-JP ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
UTF-8 혧채쩔쩌쨩혧횊책혦횠챰챤혦쨍쩌쨩혧횊채챙횜 111011011001100010100111111011001011000110000100111011001010100110010100111011001010100110001100111011001010100010101001111011011001100010100111111011011001101010001010111011001011000110000101111011011001100010100110111011011001101010100000111011001011000110110000111011001011000110100100111011011001100010100110111011001010100010001101111011001010100110001100111011001010100010101001111011011001100010100111111011011001101010001010111011001011000110000100111011001011000110011001111011011001101010011100 ed98a7ecb184eca994eca98ceca8a9ed98a7ed9a8aecb185ed98a6ed9aa0ecb1b0ecb1a4ed98a6eca88deca98ceca8a9ed98a7ed9a8aecb184ecb199ed9a9c
UHC 혧채쩔쩌쨩혧횊책혦횠챰챤혦쨍쩌쨩혧횊채챙횜 110000101000111111000011101001001100001010111111110000101011110011000010101110111100001010001111110000111000100011000011101001011100001010001110110000111001100011000011101100011100001110101110110000101000111011000010101110001100001010111100110000101011101111000010100011111100001110001000110000111010010011000011101011001100001110010110 c28fc3a4c2bfc2bcc2bbc28fc388c3a5c28ec398c3b1c3aec28ec2b8c2bcc2bbc28fc388c3a4c3acc396

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)