To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 倭??????g?[倭??????g?[^ 10011000011000000011111100111111001111110011111100111111001111111000001010000111001111110101101110011000011000000011111100111111001111110011111100111111001111111000001010000111001111110101101101011110 98603f3f3f3f3f3f82873f5b98603f3f3f3f3f3f82873f5b5e
EUC-JP 倭??????g?[倭??????g?[^ 11001111110000010011111100111111001111110011111100111111001111111010001111100111001111110101101111001111110000010011111100111111001111110011111100111111001111111010001111100111001111110101101101011110 cfc13f3f3f3f3f3fa3e73f5bcfc13f3f3f3f3f3fa3e73f5b5e
UTF-8 倭랃퐦捻덄괘亮g솃[倭랃퐦捻덄괘亮g솃[^ 111001011000000010101101111010111001111010000011111011011001000010100110111011111010011010100100111010111000110110000100111010101011010010011000111011111010010110110111111011111011110110000111111011001000011010000011010110111110010110000000101011011110101110011110100000111110110110010000101001101110111110100110101001001110101110001101100001001110101010110100100110001110111110100101101101111110111110111101100001111110110010000110100000110101101101011110 e580adeb9e83ed90a6efa6a4eb8d84eab498efa5b7efbd87ec86835be580adeb9e83ed90a6efa6a4eb8d84eab498efa5b7efbd87ec86835b5e
UHC 倭랃퐦捻덄괘亮g솃[倭랃퐦捻덄괘亮g솃[^ 111010001101111010001101111011111011110110001111111001101111011110001000111001111011000110100101111001011011100110100011111001111001100110001000010110111110100011011110100011011110111110111101100011111110011011110111100010001110011110110001101001011110010110111001101000111110011110011001100010000101101101011110 e8de8defbd8fe6f788e7b1a5e5b9a3e799885be8de8defbd8fe6f788e7b1a5e5b9a3e799885b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)