To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?????????彧???????????彧??^ 001111110011111100111111001111110011111100111111001111110011111100111111111110101011100100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111111101010111001001111110011111101011110 3f3f3f3f3f3f3f3f3ffab93f3f3f3f3f3f3f3f3f3f3ffab93f3f5e
EUC-JP ?????????彧???????????彧??^ 0011111100111111001111110011111100111111001111110011111100111111001111111000111110111100111111100011111100111111001111110011111100111111001111110011111100111111001111110011111100111111100011111011110011111110001111110011111101011110 3f3f3f3f3f3f3f3f3f8fbcfe3f3f3f3f3f3f3f3f3f3f3f8fbcfe3f3f5e
UTF-8 알렚렯렶알렖렯렟않彧롕솰알렚렯렶알렖렯렟않彧롕솩^ 11101100100101011000110011101011101000001001101011101011101000001010111111101011101000001011011011101100100101011000110011101011101000001001011011101011101000001010111111101011101000001001111111101100100101011000101011100101101111011010011111101011101000011001010111101100100001101011000011101100100101011000110011101011101000001001101011101011101000001010111111101011101000001011011011101100100101011000110011101011101000001001011011101011101000001010111111101011101000001001111111101100100101011000101011100101101111011010011111101011101000011001010111101100100001101010100101011110 ec958ceba09aeba0afeba0b6ec958ceba096eba0afeba09fec958ae5bda7eba195ec86b0ec958ceba09aeba0afeba0b6ec958ceba096eba0afeba09fec958ae5bda7eba195ec86a95e
UHC 알렚렯렶알렖렯렟않彧롕솰알렚렯렶알렖렯렟않彧롕솩^ 10111110110010111000111010101101100011101011110010001110110000011011111011001011100011101010101110001110101111001000111010110000101111101100101011101001111011101000111011011001101111001110000010111110110010111000111010101101100011101011110010001110110000011011111011001011100011101010101110001110101111001000111010110000101111101100101011101001111011101000111011011001101111001101111001011110 becb8ead8ebc8ec1becb8eab8ebc8eb0becae9ee8ed9bce0becb8ead8ebc8ec1becb8eab8ebc8eb0becae9ee8ed9bcde5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)