To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 闡ッム餓勧譛餓カ殉闡ッム餓勧譛餓カ旬^ 11101000100100011010111111010001100010011110110010001010101010011110011010011100100010011110110010110110100011110111110111101000100100011010111111010001100010011110110010001010101010011110011010011100100010011110110010110110100011110111101101011110 e891afd189ec8aa9e69c89ecb68f7de891afd189ec8aa9e69c89ecb68f7b5e
EUC-JP 闡ッム餓勧譛餓カ殉闡ッム餓勧譛餓カ旬^ 11101111111100011000111010101111100011101101000110110010111011101011010010101011111010111111110010110010111011101000111010110110101111011101111011101111111100011000111010101111100011101101000110110010111011101011010010101011111010111111110010110010111011101000111010110110101111011101110001011110 eff18eaf8ed1b2eeb4abebfcb2ee8eb6bddeeff18eaf8ed1b2eeb4abebfcb2ee8eb6bddc5e
UTF-8 闡ッム餓勧譛餓カ殉闡ッム餓勧譛餓カ旬^ 11101001100101111010000111101111101111011010111111101111101111101001000111101001101001001001001111100101100010111010011111101000101011011001101111101001101001001001001111101111101111011011011011100110101011101000100111101001100101111010000111101111101111011010111111101111101111101001000111101001101001001001001111100101100010111010011111101000101011011001101111101001101001001001001111101111101111011011011011100110100101111010110001011110 e997a1efbdafefbe91e9a493e58ba7e8ad9be9a493efbdb6e6ae89e997a1efbdafefbe91e9a493e58ba7e8ad9be9a493efbdb6e697ac5e
UHC 闡??餓??餓?殉闡??餓??餓?旬^ 111101001100010100111111001111111110010010111011001111110011111111100100101110110011111111100010111001101111010011000101001111110011111111100100101110110011111100111111111001001011101100111111111000101110001001011110 f4c53f3fe4bb3f3fe4bb3fe2e6f4c53f3fe4bb3f3fe4bb3fe2e25e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)