To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嗚??泣??儒??疫????┠釉??筌 1001101001101010001111110011111110001011100000110011111100111111100011101111001000111111001111111000100101110101001111110011111100111111001111111000010010110101111001111101011000111111001111111110001010100011 9a6a3f3f8b833f3f8ef23f3f89753f3f3f3f84b5e7d63f3fe2a3
EUC-JP 嗚??泣??儒??疫??荑?┠釉??筌 11010011110010110011111100111111101101011110001100111111001111111011110011110100001111110011111110110001110101100011111100111111100011111101011111111001001111111010100010110111111011101101100000111111001111111110010010100101 d3cb3f3fb5e33f3fbcf43f3fb1d63f3f8fd7f93fa8b7eed83f3fe4a5
UTF-8 嗚삠굦泣쒏껸儒룸겱疫뀀챶荑낉┠釉먮폇筌 111001011001011110011010111011001000001010100000111010101011010110100110111001101011001110100011111011001001001010001111111010101011101110111000111001011000010010010010111010111010001110111000111010101011001010110001111001111001011010101011111010111000000010000000111011001011000110110110111010001000110110010001111010111000001010001001111000101001010010100000111010011000011110001001111010111010100010101110111011011000111110000111111001111010110110001100 e5979aec82a0eab5a6e6b3a3ec928feabbb8e58492eba3b8eab2b1e796abeb8080ecb1b6e88d91eb8289e294a0e98789eba8aeed8f87e7ad8c
UHC 嗚삠굦泣쒏껸儒룸겱疫뀀챶荑낉┠釉먮폇筌 1110011111110000101110111110001110000010100011001110101111101000100111001110011010110010101110011110101011100011101101111110101110000001101111011110011010111001101100101110101110101010100000111110110010111111100001011110111110100110101101111110101110111000100100001110101110111100100101001110111110100111 e7f0bbe3828cebe89ce6b2b9eae3b7eb81bde6b9b2ebaa83ecbf85efa6b7ebb890ebbc94efa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)