To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??H??????????H????????^ 0011111100111111010010000011111100111111001111110011111100111111001111110011111100111111001111110011111101001000001111110011111100111111001111110011111100111111001111110011111101011110 3f3f483f3f3f3f3f3f3f3f3f3f483f3f3f3f3f3f3f3f5e
SJIS-WIN 蚓?H?耕??猥?怨魄蚓?H?耕??猥?怨白^ 111001010110110100111111010010000011111110001101011010110011111100111111111000001100111000111111100010011000010111101001101011101110010101101101001111110100100000111111100011010110101100111111001111111110000011001110001111111000100110000101100101001001001001011110 e56d3f483f8d6b3f3fe0ce3f8985e9aee56d3f483f8d6b3f3fe0ce3f898594925e
EUC-JP 蚓?H饔耕??猥?怨魄蚓?H饔耕??猥?怨白^ 11101001110011100011111101001000100011111110100011101111101110011100110000111111001111111110000011010000001111111011000111100101111100101011000011101001110011100011111101001000100011111110100011101111101110011100110000111111001111111110000011010000001111111011000111100101110001111111001001011110 e9ce3f488fe8efb9cc3f3fe0d03fb1e5f2b0e9ce3f488fe8efb9cc3f3fe0d03fb1e5c7f25e
UTF-8 蚓렚H饔耕렖렕猥렧怨魄蚓렚H饔耕렖렕猥렧怨白^ 111010001001101010010011111010111010000010011010010010001110100110100101100101001110100010000000100101011110101110100000100101101110101110100000100101011110011110001100101001011110101110100000101001111110011010000000101010001110100110101101100001001110100010011010100100111110101110100000100110100100100011101001101001011001010011101000100000001001010111101011101000001001011011101011101000001001010111100111100011001010010111101011101000001010011111100110100000001010100011100111100110011011110101011110 e89a93eba09a48e9a594e88095eba096eba095e78ca5eba0a7e680a8e9ad84e89a93eba09a48e9a594e88095eba096eba095e78ca5eba0a7e680a8e799bd5e
UHC 蚓렚H饔耕렖렕猥렧怨魄蚓렚H饔耕렖렕猥렧怨白^ 11101100111000101000111010101101010010001110100010111101110011001110100110001110101010111000111010101010111010001110010110001110101101101110101010110011110110111101111011101100111000101000111010101101010010001110100010111101110011001110100110001110101010111000111010101010111010001110010110001110101101101110101010110011110110111101110001011110 ece28ead48e8bdcce98eab8eaae8e58eb6eab3dbdeece28ead48e8bdcce98eab8eaae8e58eb6eab3dbdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)