To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???nR???n^[???nR???n^[^ 0011111100111111001111110110111001010010001111110011111100111111011011100101111001011011001111110011111100111111011011100101001000111111001111110011111101101110010111100101101101011110 3f3f3f6e523f3f3f6e5e5b3f3f3f6e523f3f3f6e5e5b5e
SJIS-WIN 亨h?nR亨h?n^[亨h?nR亨h?n^[^ 10001011100111001000001010001000001111110110111001010010100010111001110010000010100010000011111101101110010111100101101110001011100111001000001010001000001111110110111001010010100010111001110010000010100010000011111101101110010111100101101101011110 8b9c82883f6e528b9c82883f6e5e5b8b9c82883f6e528b9c82883f6e5e5b5e
EUC-JP 亨h?nR亨h?n^[亨h?nR亨h?n^[^ 10110101111111001010001111101000001111110110111001010010101101011111110010100011111010000011111101101110010111100101101110110101111111001010001111101000001111110110111001010010101101011111110010100011111010000011111101101110010111100101101101011110 b5fca3e83f6e52b5fca3e83f6e5e5bb5fca3e83f6e52b5fca3e83f6e5e5b5e
UTF-8 亨h나nR亨h나n^[亨h나nR亨h나n^[^ 1110010010111010101010001110111110111101100010001110101110000010100110000110111001010010111001001011101010101000111011111011110110001000111010111000001010011000011011100101111001011011111001001011101010101000111011111011110110001000111010111000001010011000011011100101001011100100101110101010100011101111101111011000100011101011100000101001100001101110010111100101101101011110 e4baa8efbd88eb82986e52e4baa8efbd88eb82986e5e5be4baa8efbd88eb82986e52e4baa8efbd88eb82986e5e5b5e
UHC 亨h나nR亨h나n^[亨h나nR亨h나n^[^ 1111101011111011101000111110100010110011101010100110111001010010111110101111101110100011111010001011001110101010011011100101111001011011111110101111101110100011111010001011001110101010011011100101001011111010111110111010001111101000101100111010101001101110010111100101101101011110 fafba3e8b3aa6e52fafba3e8b3aa6e5e5bfafba3e8b3aa6e52fafba3e8b3aa6e5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)