To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 將?障?除???衣私????怨封?絅??? 1001101110010010001111111000111111100001001111111000111110011100001111110011111100111111100010001101111110001110100001000011111100111111001111110011111110001001100001011001010110010101001111111110001101000100001111110011111100111111 9b923f8fe13f8f9c3f3f3f88df8e843f3f3f3f898595953fe3443f3f3f
EUC-JP 將?障?除???衣私????怨封?絅??焌 11010101111100100011111110111110111000110011111110111101111111000011111100111111001111111011000011100001101110111110010000111111001111110011111100111111101100011110010111001001111101010011111111100101101001010011111100111111100011111100100111101000 d5f23fbee33fbdfc3f3f3fb0e1bbe43f3f3f3fb1e5c9f53fe5a53f3f8fc9e8
UTF-8 將렚障렚除곁렩렰衣私렟닿렱렲怨封렮絅렏렕焌 111001011011000010000111111010111010000010011010111010011001101010011100111010111010000010011010111010011001100110100100111010101011001110000001111010111010000010101001111010111010000010110000111010001010000110100011111001111010011110000001111010111010000010011111111010111000101110111111111010111010000010110001111010111010000010110010111001101000000010101000111001011011000010000001111010111010000010101110111001111011010110000101111010111010000010001111111010111010000010010101111001111000010010001100 e5b087eba09ae99a9ceba09ae999a4eab381eba0a9eba0b0e8a1a3e7a781eba09feb8bbfeba0b1eba0b2e680a8e5b081eba0aee7b585eba08feba095e7848c
UHC 將렚障렚除곁렩렰衣私렟닿렱렲怨封렮絅렏렕焌 111011011110001010001110101011011110111010100001100011101010110111110000101101101011000011100111100011101011011110001110101111011110101111111101110111101110011110001110101100001011010011101010100011101011111010001110101111111110101010110011110111001110011010001110101110111100110011100111100011101010010110001110101010101111000111100000 ede28eadeea18eadf0b6b0e78eb78ebdebfddee78eb0b4ea8ebe8ebfeab3dce68ebbcce78ea58eaaf1e0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)