To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 而?莊?而???缺而?袁???莊?而?竣? 100011101010011100111111111001001011010100111111100011101010011100111111001111110011111111100011100111101000111010100111001111111110010111001101001111110011111100111111111001001011010100111111100011101010011100111111100011110111011000111111 8ea73fe4b53f8ea73f3f3fe39e8ea73fe5cd3f3f3fe4b53f8ea73f8f763f
EUC-JP 而?莊?而???缺而?袁???莊?而?竣? 101111001010100100111111111010001011011100111111101111001010100100111111001111110011111111100101111111101011110010101001001111111110101011001111001111110011111100111111111010001011011100111111101111001010100100111111101111011101011100111111 bca93fe8b73fbca93f3f3fe5febca93feacf3f3f3fe8b73fbca93fbdd73f
UTF-8 而렲莊렱而렲欌쇤缺而렲袁얕렫렲莊렱而렲竣렕 111010001000000010001100111010111010000010110010111010001000111010001010111010111010000010110001111010001000000010001100111010111010000010110010111001101010110010001100111011001000011110100100111001111011110010111010111010001000000010001100111010111010000010110010111010001010001010000001111011001001011010010101111010111010000010101011111010111010000010110010111010001000111010001010111010111010000010110001111010001000000010001100111010111010000010110010111001111010101110100011111010111010000010010101 e8808ceba0b2e88e8aeba0b1e8808ceba0b2e6ac8cec87a4e7bcbae8808ceba0b2e8a281ec9695eba0abeba0b2e88e8aeba0b1e8808ceba0b2e7aba3eba095
UHC 而렲莊렱而렲欌쇤缺而렲袁얕렫렲莊렱而렲竣렕 111011001011101110001110101111111110110111110110100011101011111011101100101110111000111010111111111011011110101110111100111010011100110011000000111011001011101110001110101111111110101010111110101111101110100010001110101110011000111010111111111011011111011010001110101111101110110010111011100011101011111111110001111000101000111010101010 ecbb8ebfedf68ebeecbb8ebfedebbce9ccc0ecbb8ebfeabebee88eb98ebfedf68ebeecbb8ebff1e28eaa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)