To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 而?齋槃??蒸?n}而?齋槃??蒸?n{^ 1000111010100111001111111110001001010110100111101100111100111111001111111000111111110110001111110110111001111101100011101010011100111111111000100101011010011110110011110011111100111111100011111111011000111111011011100111101101011110 8ea73fe2569ecf3f3f8ff63f6e7d8ea73fe2569ecf3f3f8ff63f6e7b5e
EUC-JP 而?齋槃??蒸?n}而?齋槃??蒸?n{^ 1011110010101001001111111110001110110111110111001101000100111111001111111011111011111000001111110110111001111101101111001010100100111111111000111011011111011100110100010011111100111111101111101111100000111111011011100111101101011110 bca93fe3b7dcd13f3fbef83f6e7dbca93fe3b7dcd13f3fbef83f6e7b5e
UTF-8 而렲齋槃롚렢蒸렒n}而렲齋槃롚렢蒸렒n{^ 1110100010000000100011001110101110100000101100101110100110111101100010111110011010100111100000111110101110100001100110101110101110100000101000101110100010010010101110001110101110100000100100100110111001111101111010001000000010001100111010111010000010110010111010011011110110001011111001101010011110000011111010111010000110011010111010111010000010100010111010001001001010111000111010111010000010010010011011100111101101011110 e8808ceba0b2e9bd8be6a783eba19aeba0a2e892b8eba0926e7de8808ceba0b2e9bd8be6a783eba19aeba0a2e892b8eba0926e7b5e
UHC 而렲齋槃롚렢蒸렒n}而렲齋槃롚렢蒸렒n{^ 11101100101110111000111010111111111011101011000111011010111010011000111011011110100011101011001111110001111110101000111010100111011011100111110111101100101110111000111010111111111011101011000111011010111010011000111011011110100011101011001111110001111110101000111010100111011011100111101101011110 ecbb8ebfeeb1dae98ede8eb3f1fa8ea76e7decbb8ebfeeb1dae98ede8eb3f1fa8ea76e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)