To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ??移?悠??怨烽∧??移?悠??怨烽∧^ 00111111001111111000100011011010001111111001011101001001001111110011111110001001100001011110000010000010100000011100100000111111001111111000100011011010001111111001011101001001001111110011111110001001100001011110000010000010100000011100100001011110 3f3f88da3f97493f3f8985e08281c83f3f88da3f97493f3f8985e08281c85e
EUC-JP 佾?移?悠??怨烽∧佾?移?悠??怨烽∧^ 1000111110110000111110110011111110110000110111000011111111001101101010100011111100111111101100011110010111011111111000101010001011001010100011111011000011111011001111111011000011011100001111111100110110101010001111110011111110110001111001011101111111100010101000101100101001011110 8fb0fb3fb0dc3fcdaa3f3fb1e5dfe2a2ca8fb0fb3fb0dc3fcdaa3f3fb1e5dfe2a2ca5e
UTF-8 佾렪移렊悠꿴떵怨烽∧佾렪移렊悠꿴떵怨烽∧^ 11100100101111011011111011101011101000001010101011100111101001111011101111101011101000001000101011100110100000101010000011101010101111111011010011101011100101101011010111100110100000001010100011100111100000111011110111100010100010001010011111100100101111011011111011101011101000001010101011100111101001111011101111101011101000001000101011100110100000101010000011101010101111111011010011101011100101101011010111100110100000001010100011100111100000111011110111100010100010001010011101011110 e4bdbeeba0aae7a7bbeba08ae682a0eabfb4eb96b5e680a8e783bde288a7e4bdbeeba0aae7a7bbeba08ae682a0eabfb4eb96b5e680a8e783bde288a75e
UHC 佾렪移렊悠꿴떵怨烽∧佾렪移렊悠꿴떵怨烽∧^ 1110110011101011100011101011100011101100101110011000111010100001111010101110110110110010111010011011011010111010111010101011001111011100111010111010000111111100111011001110101110001110101110001110110010111001100011101010000111101010111011011011001011101001101101101011101011101010101100111101110011101011101000011111110001011110 eceb8eb8ecb98ea1eaedb2e9b6baeab3dceba1fceceb8eb8ecb98ea1eaedb2e9b6baeab3dceba1fc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)