To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????[????????[^ 00111111001111110011111100111111001111110011111100111111001111110101101100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 艱城宛丼艱城宛丼[艱城宛丼艱城宛丼[^ 1110010010000101100011111110100110001000101101101001100010100101111001001000010110001111111010011000100010110110100110001010010101011011111001001000010110001111111010011000100010110110100110001010010111100100100001011000111111101001100010001011011010011000101001010101101101011110 e4858fe988b698a5e4858fe988b698a55be4858fe988b698a5e4858fe988b698a55b5e
EUC-JP 艱城宛丼艱城宛丼[艱城宛丼艱城宛丼[^ 1110011111100101101111101110101110110000101110001101000010100111111001111110010110111110111010111011000010111000110100001010011101011011111001111110010110111110111010111011000010111000110100001010011111100111111001011011111011101011101100001011100011010000101001110101101101011110 e7e5beebb0b8d0a7e7e5beebb0b8d0a75be7e5beebb0b8d0a7e7e5beebb0b8d0a75b5e
UTF-8 艱城宛丼艱城宛丼[艱城宛丼艱城宛丼[^ 111010001000100110110001111001011001111110001110111001011010111010011011111001001011100010111100111010001000100110110001111001011001111110001110111001011010111010011011111001001011100010111100010110111110100010001001101100011110010110011111100011101110010110101110100110111110010010111000101111001110100010001001101100011110010110011111100011101110010110101110100110111110010010111000101111000101101101011110 e889b1e59f8ee5ae9be4b8bce889b1e59f8ee5ae9be4b8bc5be889b1e59f8ee5ae9be4b8bce889b1e59f8ee5ae9be4b8bc5b5e
UHC 艱城宛?艱城宛?[艱城宛?艱城宛?[^ 11001010110111101110000011110010111010001100100000111111110010101101111011100000111100101110100011001000001111110101101111001010110111101110000011110010111010001100100000111111110010101101111011100000111100101110100011001000001111110101101101011110 cadee0f2e8c83fcadee0f2e8c83f5bcadee0f2e8c83fcadee0f2e8c83f5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)