To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????n}???????????n{^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 肛?虞?畯驀???逗?n}肛?虞?畯驀???逗?n{^ 11100011111010000011111110001011111100010011111111111011011011111110100101111101001111110011111100111111100100001000000000111111011011100111110111100011111010000011111110001011111100010011111111111011011011111110100101111101001111110011111100111111100100001000000000111111011011100111101101011110 e3e83f8bf13ffb6fe97d3f3f3f90803f6e7de3e83f8bf13ffb6fe97d3f3f3f90803f6e7b5e
EUC-JP 肛?虞?畯驀??祜逗?n}肛?虞?畯驀??祜逗?n{^ 11100110111010100011111110110110111100110011111110001111110011011011101111110001110111100011111100111111100011111101000011011000101111111110000000111111011011100111110111100110111010100011111110110110111100110011111110001111110011011011101111110001110111100011111100111111100011111101000011011000101111111110000000111111011011100111101101011110 e6ea3fb6f33f8fcdbbf1de3f3f8fd0d8bfe03f6e7de6ea3fb6f33f8fcdbbf1de3f3f8fd0d8bfe03f6e7b5e
UTF-8 肛렚虞렧畯驀렖렕祜逗백n}肛렚虞렧畯驀렖렕祜逗백n{^ 1110100010000010100110111110101110100000100110101110100010011001100111101110101110100000101001111110011110010101101011111110100110101001100000001110101110100000100101101110101110100000100101011110011110100101100111001110100110000000100101111110101110110000101100010110111001111101111010001000001010011011111010111010000010011010111010001001100110011110111010111010000010100111111001111001010110101111111010011010100110000000111010111010000010010110111010111010000010010101111001111010010110011100111010011000000010010111111010111011000010110001011011100111101101011110 e8829beba09ae8999eeba0a7e795afe9a980eba096eba095e7a59ce98097ebb0b16e7de8829beba09ae8999eeba0a7e795afe9a980eba096eba095e7a59ce98097ebb0b16e7b5e
UHC 肛렚虞렧畯驀렖렕祜逗백n}肛렚虞렧畯驀렖렕祜逗백n{^ 11111001111111011000111010101101111010011110010110001110101101101111000111100001110110001110100110001110101010111000111010101010111110111101010011010100111010001011100111101001011011100111110111111001111111011000111010101101111010011110010110001110101101101111000111100001110110001110100110001110101010111000111010101010111110111101010011010100111010001011100111101001011011100111101101011110 f9fd8eade9e58eb6f1e1d8e98eab8eaafbd4d4e8b9e96e7df9fd8eade9e58eb6f1e1d8e98eab8eaafbd4d4e8b9e96e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)