To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???i???iB 001111110011111100111111011010010011111100111111001111110110100101000010 3f3f3f693f3f3f6942
SJIS-WIN 肛雁害i肛雁害iB 111000111110100010001010111001011000101001010001011010011110001111101000100010101110010110001010010100010110100101000010 e3e88ae58a5169e3e88ae58a516942
EUC-JP 肛雁害i肛雁害iB 111001101110101010110100111001111011001110110010011010011110011011101010101101001110011110110011101100100110100101000010 e6eab4e7b3b269e6eab4e7b3b26942
UTF-8 肛雁害i肛雁害iB 111010001000001010011011111010011001101110000001111001011010111010110011011010011110100010000010100110111110100110011011100000011110010110101110101100110110100101000010 e8829be99b81e5aeb369e8829be99b81e5aeb36942
UHC 肛雁害i肛雁害iB 111110011111110111100100110100101111101010101010011010011111100111111101111001001101001011111010101010100110100101000010 f9fde4d2faaa69f9fde4d2faaa6942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)