To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 日?而?恁?而???缺而?才?而?莊? 10010011111110100011111110001110101001110011111110011100100011000011111110001110101001110011111100111111001111111110001110011110100011101010011100111111100011011100101100111111100011101010011100111111111001001011010100111111 93fa3f8ea73f9c8c3f8ea73f3f3fe39e8ea73f8dcb3f8ea73fe4b53f
EUC-JP 日?而?恁?而?佾?缺而?才?而?莊? 110001101111110000111111101111001010100100111111110101111110110000111111101111001010100100111111100011111011000011111011001111111110010111111110101111001010100100111111101110101100110100111111101111001010100100111111111010001011011100111111 c6fc3fbca93fd7ec3fbca93f8fb0fb3fe5febca93fbacd3fbca93fe8b73f
UTF-8 日렮而렲恁렱而렲佾쇤缺而렲才렱而렲莊렱 111001101001011110100101111010111010000010101110111010001000000010001100111010111010000010110010111001101000000110000001111010111010000010110001111010001000000010001100111010111010000010110010111001001011110110111110111011001000011110100100111001111011110010111010111010001000000010001100111010111010000010110010111001101000100110001101111010111010000010110001111010001000000010001100111010111010000010110010111010001000111010001010111010111010000010110001 e697a5eba0aee8808ceba0b2e68181eba0b1e8808ceba0b2e4bdbeec87a4e7bcbae8808ceba0b2e6898deba0b1e8808ceba0b2e88e8aeba0b1
UHC 日렮而렲恁렱而렲佾쇤缺而렲才렱而렲莊렱 1110110011101101100011101011101111101100101110111000111010111111111011001111011010001110101111101110110010111011100011101011111111101100111010111011110011101001110011001100000011101100101110111000111010111111111011101010011010001110101111101110110010111011100011101011111111101101111101101000111010111110 eced8ebbecbb8ebfecf68ebeecbb8ebfecebbce9ccc0ecbb8ebfeea68ebeecbb8ebfedf68ebe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)