To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?C?nf?C?n^}Y?C?nf?C?n^}bE 00111111010000110011111101101110011001100011111101000011001111110110111001011110011111010101100100111111010000110011111101101110011001100011111101000011001111110110111001011110011111010110001001000101 3f433f6e663f433f6e5e7d593f433f6e663f433f6e5e7d6245
SJIS-WIN 栖C惑nf栖C惑n^}Y栖C惑nf栖C惑n^}bE 100100001011001001000011100110000110011001101110011001101001000010110010010000111001100001100110011011100101111001111101010110011001000010110010010000111001100001100110011011100110011010010000101100100100001110011000011001100110111001011110011111010110001001000101 90b24398666e6690b24398666e5e7d5990b24398666e6690b24398666e5e7d6245
EUC-JP 栖C惑nf栖C惑n^}Y栖C惑nf栖C惑n^}bE 110000001011010001000011110011111100011101101110011001101100000010110100010000111100111111000111011011100101111001111101010110011100000010110100010000111100111111000111011011100110011011000000101101000100001111001111110001110110111001011110011111010110001001000101 c0b443cfc76e66c0b443cfc76e5e7d59c0b443cfc76e66c0b443cfc76e5e7d6245
UTF-8 栖C惑nf栖C惑n^}Y栖C惑nf栖C惑n^}bE 1110011010100000100101100100001111100110100000111001000101101110011001101110011010100000100101100100001111100110100000111001000101101110010111100111110101011001111001101010000010010110010000111110011010000011100100010110111001100110111001101010000010010110010000111110011010000011100100010110111001011110011111010110001001000101 e6a09643e683916e66e6a09643e683916e5e7d59e6a09643e683916e66e6a09643e683916e5e7d6245
UHC 栖C惑nf栖C惑n^}Y栖C惑nf栖C惑n^}bE 110111111111011101000011111110111110001101101110011001101101111111110111010000111111101111100011011011100101111001111101010110011101111111110111010000111111101111100011011011100110011011011111111101110100001111111011111000110110111001011110011111010110001001000101 dff743fbe36e66dff743fbe36e5e7d59dff743fbe36e66dff743fbe36e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)