To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??爰???ъ?筌??逾g?誘??依?癲 111000011001111100111111001111111110000010100111001111110011111100111111100001001000110000111111111000101010001100111111001111111110011110100101100000101000011100111111100101110101010100111111001111111000100011001011001111111110000110011111 e19f3f3fe0a73f3f3f848c3fe2a33f3fe7a582873f97553f3f88cb3fe19f
EUC-JP 癲??爰??蓀ъ?筌??逾g?誘??依?癲 1110001010100001001111110011111111100000101010010011111100111111100011111101100011111000101001111110110000111111111001001010010100111111001111111110111010100111101000111110011100111111110011011011011000111111001111111011000011001101001111111110001010100001 e2a13f3fe0a93f3f8fd8f8a7ec3fe4a53f3feea7a3e73fcdb63f3fb0cd3fe2a1
UTF-8 癲ㅺ퓭爰귝끽蓀ъ젘筌뚯궡逾g춯誘띾뿫依췆癲 1110011110011001101100101110001110000101101110101110110110010011101011011110011110001000101100001110101010110111100111011110101110000001101111011110100010010011100000001101000110001010111011001010000010011000111001111010110110001100111010111001101010101111111010101011011010100001111010011000000010111110111011111011110110000111111011001011011010101111111010001010101010011000111010111001110110111110111010111011111110101011111001001011111010011101111011001011011110000110111001111001100110110010 e799b2e385baed93ade788b0eab79deb81bde89380d18aeca098e7ad8ceb9aafeab6a1e980beefbd87ecb6afe8aa98eb9dbeebbfabe4be9decb786e799b2
UHC 癲ㅺ퓭爰귝끽蓀ъ젘筌뚯궡逾g춯誘띾뿫依췆癲 111011111010011010100100111010101011111110010100111010101011101010000010111001101011001110100011111000011110000010101100111011001010000010010100111011111010011110001100111011001000001010110100111010111011010110100011111001111010110110001100111010111010111110001101111010111001011110101011111010111110111010101110010000011110111110100110 efa6a4eabf94eaba82e6b3a3e1e0aceca094efa78cec82b4ebb5a3e7ad8cebaf8deb97abebeeae41efa6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)