To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????R????^[????R????^[^ 0011111100111111001111110011111101010010001111110011111100111111001111110101111001011011001111110011111100111111001111110101001000111111001111110011111100111111010111100101101101011110 3f3f3f3f523f3f3f3f5e5b3f3f3f3f523f3f3f3f5e5b5e
SJIS-WIN 豎壽ア嗜R豎壽ア嗜^[豎壽ア嗜R豎壽ア嗜^[^ 1110011010110001100110101110011010110001100110100110111001010010111001101011000110011010111001101011000110011010011011100101111001011011111001101011000110011010111001101011000110011010011011100101001011100110101100011001101011100110101100011001101001101110010111100101101101011110 e6b19ae6b19a6e52e6b19ae6b19a6e5e5be6b19ae6b19a6e52e6b19ae6b19a6e5e5b5e
EUC-JP 豎壽ア嗜R豎壽ア嗜^[豎壽ア嗜R豎壽ア嗜^[^ 111011001011001111010100111010001000111010110001110100111100111101010010111011001011001111010100111010001000111010110001110100111100111101011110010110111110110010110011110101001110100010001110101100011101001111001111010100101110110010110011110101001110100010001110101100011101001111001111010111100101101101011110 ecb3d4e88eb1d3cf52ecb3d4e88eb1d3cf5e5becb3d4e88eb1d3cf52ecb3d4e88eb1d3cf5e5b5e
UTF-8 豎壽ア嗜R豎壽ア嗜^[豎壽ア嗜R豎壽ア嗜^[^ 11101000101100011000111011100101101000111011110111101111101111011011000111100101100101111001110001010010111010001011000110001110111001011010001110111101111011111011110110110001111001011001011110011100010111100101101111101000101100011000111011100101101000111011110111101111101111011011000111100101100101111001110001010010111010001011000110001110111001011010001110111101111011111011110110110001111001011001011110011100010111100101101101011110 e8b18ee5a3bdefbdb1e5979c52e8b18ee5a3bdefbdb1e5979c5e5be8b18ee5a3bdefbdb1e5979c52e8b18ee5a3bdefbdb1e5979c5e5b5e
UHC ?壽?嗜R?壽?嗜^[?壽?嗜R?壽?嗜^[^ 00111111111000011111100000111111110100001110111001010010001111111110000111111000001111111101000011101110010111100101101100111111111000011111100000111111110100001110111001010010001111111110000111111000001111111101000011101110010111100101101101011110 3fe1f83fd0ee523fe1f83fd0ee5e5b3fe1f83fd0ee523fe1f83fd0ee5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)