To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 而?莊?而?齋槃缺而?圓???莊?而? 1000111010100111001111111110010010110101001111111000111010100111001111111110001001010110100111101100111111100011100111101000111010100111001111111001101010100010001111110011111100111111111001001011010100111111100011101010011100111111 8ea73fe4b53f8ea73fe2569ecfe39e8ea73f9aa23f3f3fe4b53f8ea73f
EUC-JP 而?莊?而?齋槃缺而?圓???莊?而? 1011110010101001001111111110100010110111001111111011110010101001001111111110001110110111110111001101000111100101111111101011110010101001001111111101010010100100001111110011111100111111111010001011011100111111101111001010100100111111 bca93fe8b73fbca93fe3b7dcd1e5febca93fd4a43f3f3fe8b73fbca93f
UTF-8 而렲莊렱而렲齋槃缺而렲圓꿱렫렲莊렱而렲 111010001000000010001100111010111010000010110010111010001000111010001010111010111010000010110001111010001000000010001100111010111010000010110010111010011011110110001011111001101010011110000011111001111011110010111010111010001000000010001100111010111010000010110010111001011001110010010011111010101011111110110001111010111010000010101011111010111010000010110010111010001000111010001010111010111010000010110001111010001000000010001100111010111010000010110010 e8808ceba0b2e88e8aeba0b1e8808ceba0b2e9bd8be6a783e7bcbae8808ceba0b2e59c93eabfb1eba0abeba0b2e88e8aeba0b1e8808ceba0b2
UHC 而렲莊렱而렲齋槃缺而렲圓꿱렫렲莊렱而렲 1110110010111011100011101011111111101101111101101000111010111110111011001011101110001110101111111110111010110001110110101110100111001100110000001110110010111011100011101011111111101010101011011011001011101000100011101011100110001110101111111110110111110110100011101011111011101100101110111000111010111111 ecbb8ebfedf68ebeecbb8ebfeeb1dae9ccc0ecbb8ebfeaadb2e88eb98ebfedf68ebeecbb8ebf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)