To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 庸??媛??癒レ?獄??愿??懿??域 100101110110011000111111001111111001010101010001001111110011111110010110111111001000001110001100001111111000110110010110001111110011111110011100110000110011111100111111100111001111001000111111001111111000100011100110 97663f3f95513f3f96fc838c3f8d963f3f9cc33f3f9cf23f3f88e6
EUC-JP 庸??媛??癒レ?獄??愿??懿??域 110011011100011100111111001111111100100110110010001111110011111111001100111111101010010111101100001111111011100111110110001111110011111111011000110001010011111100111111110110001111010000111111001111111011000011101000 cdc73f3fc9b23f3fccfea5ec3fb9f63f3fd8c53f3fd8f43f3fb0e8
UTF-8 庸뉗빖媛쇿푻癒レ칮獄쏄퉫愿닻펶懿멸컩域 111001011011101010111000111010111000100110010111111010111011100110010110111001011010101010011011111011001000011110111111111011011001000110111011111001111001100110010010111000111000001110101100111011001011100110101110111001111000110110000100111011001000111110000100111011011000100110101011111001101000010010111111111010111000101110111011111011011000111010110110111001101000011110111111111010111010100110111000111011001011101110101001111001011001111110011111 e5bab8eb8997ebb996e5aa9bec87bfed91bbe79992e383acecb9aee78d84ec8f84ed89abe684bfeb8bbbed8eb6e687bfeba9b8ecbba9e59f9f
UHC 庸뉗빖媛쇿푻癒レ칮獄쏄퉫愿닻펶懿멸컩域 1110100110111100100001111110110010010101101110001110101010110000100110011110010110111110100001111110101110101000101010111110110010101111100000011110100010101011100110111110101010111001100000111110101010110100101101001110100110111100100001111110101111110011101110001110101010110000100100011110011010110100 e9bc87ec95b8eab099e5be87eba8abecaf81e8ab9beab983eab4b4e9bc87ebf3b8eab091e6b4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)