To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 而?莊?而?錚校??莊?而?????莊?而? 10001110101001110011111111100100101101010011111110001110101001110011111111101000010000101000110101011010001111110011111111100100101101010011111110001110101001110011111100111111001111110011111100111111111001001011010100111111100011101010011100111111 8ea73fe4b53f8ea73fe8428d5a3f3fe4b53f8ea73f3f3f3f3fe4b53f8ea73f
EUC-JP 而?莊?而?錚校??莊?而?檉???莊?而? 101111001010100100111111111010001011011100111111101111001010100100111111111011111010001110111001101110110011111100111111111010001011011100111111101111001010100100111111100011111100010110111011001111110011111100111111111010001011011100111111101111001010100100111111 bca93fe8b73fbca93fefa3b9bb3f3fe8b73fbca93f8fc5bb3f3f3fe8b73fbca93f
UTF-8 而렲莊렱而렲錚校렫렲莊렱而렲檉멱렫렲莊렱而렲 111010001000000010001100111010111010000010110010111010001000111010001010111010111010000010110001111010001000000010001100111010111010000010110010111010011000110010011010111001101010000010100001111010111010000010101011111010111010000010110010111010001000111010001010111010111010000010110001111010001000000010001100111010111010000010110010111001101010101010001001111010111010100110110001111010111010000010101011111010111010000010110010111010001000111010001010111010111010000010110001111010001000000010001100111010111010000010110010 e8808ceba0b2e88e8aeba0b1e8808ceba0b2e98c9ae6a0a1eba0abeba0b2e88e8aeba0b1e8808ceba0b2e6aa89eba9b1eba0abeba0b2e88e8aeba0b1e8808ceba0b2
UHC 而렲莊렱而렲錚校렫렲莊렱而렲檉멱렫렲莊렱而렲 1110110010111011100011101011111111101101111101101000111010111110111011001011101110001110101111111110111010110110110011101110100010001110101110011000111010111111111011011111011010001110101111101110110010111011100011101011111111101111111000001011100011101000100011101011100110001110101111111110110111110110100011101011111011101100101110111000111010111111 ecbb8ebfedf68ebeecbb8ebfeeb6cee88eb98ebfedf68ebeecbb8ebfefe0b8e88eb98ebfedf68ebeecbb8ebf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)