To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 擁??松??塋??液??耶??耀?ザ???^ 1001011101101001001111110011111110001111101111000011111100111111100110101100100000111111001111111000100101110100001111110011111110010110111010110011111100111111100101110111001100111111100000110101010100111111001111110011111101011110 97693f3f8fbc3f3f9ac83f3f89743f3f96eb3f3f97733f83553f3f3f5e
EUC-JP 擁??松??塋??液??耶??耀?ザ???^ 1100110111001010001111110011111110111110101111100011111100111111110101001100101000111111001111111011000111010101001111110011111111001100111011010011111100111111110011011101010000111111101001011011011000111111001111110011111101011110 cdca3f3fbebe3f3fd4ca3f3fb1d53f3fcced3f3fcdd43fa5b63f3f3f5e
UTF-8 擁쇿췃松덆쪛塋띕졁液ㅸ텤耶잂늿耀붺ザ溜꿴턄^ 11100110100100111000000111101100100001111011111111101100101101111000001111100110100111011011111011101011100011011000011011101100101010101001101111100101101000011000101111101011100111011001010111101100101000011000000111100110101101101011001011100011100001011011100011101101100001011010010011101000100000001011011011101100100111101000001011101011100010101011111111101000100000001000000011101011101101101011101011100011100000101011011011101111101001111000101111101010101111111011010011101101100001001000010001011110 e69381ec87bfecb783e69dbeeb8d86ecaa9be5a18beb9d95eca181e6b6b2e385b8ed85a4e880b6ec9e82eb8abfe88080ebb6bae382b6efa78beabfb4ed84845e
UHC 擁쇿췃松덆쪛塋띕졁液ㅸ텤耶잂늿耀붺ザ溜꿴턄^ 11101000101101101001100111100101101011011001111111100001111001101000100011101001101001011001010011100111101010111011011011101011101000001011001011100100111110111010010011101000101101101001100111100101101011011001111111100010100010001000100011101001101001011001010011100111101010111011011011101010111111101011001011101001101101011010000001011110 e8b699e5ad9fe1e688e9a594e7abb6eba0b2e4fba4e8b699e5ad9fe28888e9a594e7abb6eafeb2e9b5a05e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)