To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 蒸?義???畯?n}蒸?義???畯?n{^ 100011111111011000111111100010110110000000111111001111110011111111111011011011110011111101101110011111011000111111110110001111111000101101100000001111110011111100111111111110110110111100111111011011100111101101011110 8ff63f8b603f3f3ffb6f3f6e7d8ff63f8b603f3f3ffb6f3f6e7b5e
EUC-JP 蒸?義???畯?n}蒸?義???畯?n{^ 1011111011111000001111111011010111000001001111110011111100111111100011111100110110111011001111110110111001111101101111101111100000111111101101011100000100111111001111110011111110001111110011011011101100111111011011100111101101011110 bef83fb5c13f3f3f8fcdbb3f6e7dbef83fb5c13f3f3f8fcdbb3f6e7b5e
UTF-8 蒸렟義얹렰렖畯둔n}蒸렟義얹렰렖畯둔n{^ 1110100010010010101110001110101110100000100111111110011110111110101010011110110010010110101110011110101110100000101100001110101110100000100101101110011110010101101011111110101110010001100101000110111001111101111010001001001010111000111010111010000010011111111001111011111010101001111011001001011010111001111010111010000010110000111010111010000010010110111001111001010110101111111010111001000110010100011011100111101101011110 e892b8eba09fe7bea9ec96b9eba0b0eba096e795afeb91946e7de892b8eba09fe7bea9ec96b9eba0b0eba096e795afeb91946e7b5e
UHC 蒸렟義얹렰렖畯둔n}蒸렟義얹렰렖畯둔n{^ 11110001111110101000111010110000111010111111100110111110111100011000111010111101100011101010101111110001111000011011010111010000011011100111110111110001111110101000111010110000111010111111100110111110111100011000111010111101100011101010101111110001111000011011010111010000011011100111101101011110 f1fa8eb0ebf9bef18ebd8eabf1e1b5d06e7df1fa8eb0ebf9bef18ebd8eabf1e1b5d06e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)