To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 鋗爾鴆竺淲リ鯲タn}鋗爾鴆竺淲リ鯲タn{^ 111110111101000010001110101000101110100111101111100011101011000111111011010001001101100011101001110011001100000001101110011111011111101111010000100011101010001011101001111011111000111010110001111110110100010011011000111010011100110011000000011011100111101101011110 fbd08ea2e9ef8eb1fb44d8e9ccc06e7dfbd08ea2e9ef8eb1fb44d8e9ccc06e7b5e
EUC-JP 鋗爾鴆竺?リ鯲タn}鋗爾鴆竺?リ鯲タn{^ 10001111111001001100000110111100101001001111001011110001101111001011001100111111100011101101100011110010110011101000111011000000011011100111110110001111111001001100000110111100101001001111001011110001101111001011001100111111100011101101100011110010110011101000111011000000011011100111101101011110 8fe4c1bca4f2f1bcb33f8ed8f2ce8ec06e7d8fe4c1bca4f2f1bcb33f8ed8f2ce8ec06e7b5e
UTF-8 鋗爾鴆竺淲リ鯲タn}鋗爾鴆竺淲リ鯲タn{^ 1110100110001011100101111110011110001000101111101110100110110100100001101110011110101011101110101110011010110111101100101110111110111110100110001110100110101111101100101110111110111110100000000110111001111101111010011000101110010111111001111000100010111110111010011011010010000110111001111010101110111010111001101011011110110010111011111011111010011000111010011010111110110010111011111011111010000000011011100111101101011110 e98b97e788bee9b486e7abbae6b7b2efbe98e9afb2efbe806e7de98b97e788bee9b486e7abbae6b7b2efbe98e9afb2efbe806e7b5e
UHC ?爾?竺????n}?爾?竺????n{^ 00111111111011001011001100111111111101011110011100111111001111110011111100111111011011100111110100111111111011001011001100111111111101011110011100111111001111110011111100111111011011100111101101011110 3fecb33ff5e73f3f3f3f6e7d3fecb33ff5e73f3f3f3f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)