To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????E 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN 弱?????塢?????絶??節よ?節ョぐE 100011101110001100111111001111110011111100111111001111111001101011000111001111110011111100111111001111110011111110010000111000100011111100111111100100001101111110000010111001100011111110010000110111111000001110000111100000101010111001000101 8ee33f3f3f3f3f9ac73f3f3f3f3f90e23f3f90df82e63f90df838782ae45
EUC-JP 弱??濚??塢?????絶??節よ?節ョぐE 1011110011100101001111110011111110001111110010011010000100111111001111111101010011001001001111110011111100111111001111110011111111000000111001000011111100111111110000001110000110100100111010000011111111000000111000011010010111100111101001001011000001000101 bce53f3f8fc9a13f3fd4c93f3f3f3f3fc0e43f3fc0e1a4e83fc0e1a5e7a4b045
UTF-8 弱놅풎濚믭쉠塢뽳쉥令밧컛絶믥퉺節よ뱰節ョぐE 11100101101111001011000111101011100001101000010111101101100100101000111011100110101111111001101011101011101011111010110111101100100010011010000011100101101000011010001011101011101111011011001111101100100010011010010111101111101001101010100011101011101100001010011111101100101110111001101111100111101101011011011011101011101011111010010111101101100010011011101011100111101011111000000011100011100000101000100011101011101100011011000011100111101011111000000011100011100000111010011111100011100000011001000001000101 e5bcb1eb8685ed928ee6bf9aebafadec89a0e5a1a2ebbdb3ec89a5efa6a8ebb0a7ecbb9be7b5b6ebafa5ed89bae7af80e38288ebb1b0e7af80e383a7e3819045
UHC 弱놅풎濚믭쉠塢뽳쉥令밧컛絶믥퉺節よ뱰節ョぐE 11100101101100001000011011101111101111101001001011100111101110011001001011101111101111011010101011100111111100011001011011101111101111011010101111100111101010011011100111100101101100001000011011101111101111101001001011100111101110011001001011101111101111011010101011101000100100111001011011101111101111011010101111100111101010101011000001000101 e5b086efbe92e7b992efbdaae7f196efbdabe7a9b9e5b086efbe92e7b992efbdaae89396efbdabe7aab045

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)