To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ìö´wìö°ì§¸ìö´wìö°ì§¸^ 111011001111011010110100011101111110110011110110101100001110110010100111101110001110110011110110101101000111011111101100111101101011000011101100101001111011100001011110 ecf6b477ecf6b0eca7b8ecf6b477ecf6b0eca7b85e
SJIS-WIN ??´w??°?§???´w??°?§?^ 001111110011111110000001010011000111011100111111001111111000000110001011001111111000000110011000001111110011111100111111100000010100110001110111001111110011111110000001100010110011111110000001100110000011111101011110 3f3f814c773f3f818b3f81983f3f3f814c773f3f818b3f81983f5e
EUC-JP ìö´wìö°ì§¸ìö´wìö°ì§¸^ 100011111010101111000000100011111010101111010011101000011010110101110111100011111010101111000000100011111010101111010011101000011110101110001111101010111100000010100001111110001000111110100010101100011000111110101011110000001000111110101011110100111010000110101101011101111000111110101011110000001000111110101011110100111010000111101011100011111010101111000000101000011111100010001111101000101011000101011110 8fabc08fabd3a1ad778fabc08fabd3a1eb8fabc0a1f88fa2b18fabc08fabd3a1ad778fabc08fabd3a1eb8fabc0a1f88fa2b15e
UTF-8 ìö´wìö°ì§¸ìö´wìö°ì§¸^ 110000111010110011000011101101101100001010110100011101111100001110101100110000111011011011000010101100001100001110101100110000101010011111000010101110001100001110101100110000111011011011000010101101000111011111000011101011001100001110110110110000101011000011000011101011001100001010100111110000101011100001011110 c3acc3b6c2b477c3acc3b6c2b0c3acc2a7c2b8c3acc3b6c2b477c3acc3b6c2b0c3acc2a7c2b85e
UHC ??´w??°?§¸??´w??°?§¸^ 0011111100111111101000101010010101110111001111110011111110100001110001100011111110100001110101111010001010101100001111110011111110100010101001010111011100111111001111111010000111000110001111111010000111010111101000101010110001011110 3f3fa2a5773f3fa1c63fa1d7a2ac3f3fa2a5773f3fa1c63fa1d7a2ac5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)