To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 îø´îººäá»×îø´îººäá»×B 111011101111100010110100111011101011101010111010111001001110000110111011110101111110111011111000101101001110111010111010101110101110010011100001101110111101011101000010 eef8b4eebabae4e1bbd7eef8b4eebabae4e1bbd742
SJIS-WIN ??´??????×??´??????×B 00111111001111111000000101001100001111110011111100111111001111110011111100111111100000010111111000111111001111111000000101001100001111110011111100111111001111110011111100111111100000010111111001000010 3f3f814c3f3f3f3f3f3f817e3f3f814c3f3f3f3f3f3f817e42
EUC-JP îø´îººäá?×îø´îººäá?×B 1000111110101011110000101000111110101001110011001010000110101101100011111010101111000010100011111010001011101011100011111010001011101011100011111010101110100011100011111010101110100001001111111010000111011111100011111010101111000010100011111010100111001100101000011010110110001111101010111100001010001111101000101110101110001111101000101110101110001111101010111010001110001111101010111010000100111111101000011101111101000010 8fabc28fa9cca1ad8fabc28fa2eb8fa2eb8faba38faba13fa1df8fabc28fa9cca1ad8fabc28fa2eb8fa2eb8faba38faba13fa1df42
UTF-8 îø´îººäá»×îø´îººäá»×B 1100001110101110110000111011100011000010101101001100001110101110110000101011101011000010101110101100001110100100110000111010000111000010101110111100001110010111110000111010111011000011101110001100001010110100110000111010111011000010101110101100001010111010110000111010010011000011101000011100001010111011110000111001011101000010 c3aec3b8c2b4c3aec2bac2bac3a4c3a1c2bbc397c3aec3b8c2b4c3aec2bac2bac3a4c3a1c2bbc39742
UHC ?ø´?ºº???×?ø´?ºº???×B 00111111101010011010101010100010101001010011111110101000101011001010100010101100001111110011111100111111101000011011111100111111101010011010101010100010101001010011111110101000101011001010100010101100001111110011111100111111101000011011111101000010 3fa9aaa2a53fa8aca8ac3f3f3fa1bf3fa9aaa2a53fa8aca8ac3f3f3fa1bf42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)