To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 雲?d?懇??齋砧雲?d?懇??齋浸^ 1000100101011111001111111000001010000100001111111000110110100111001111110011111111100010010101101000101101101101100010010101111100111111100000101000010000111111100011011010011100111111001111111110001001010110100100000101101001011110 895f3f82843f8da73f3fe2568b6d895f3f82843f8da73f3fe256905a5e
EUC-JP 雲?d?懇庾?齋砧雲?d?懇庾?齋浸^ 101100011100000000111111101000111110010000111111101110101010100110001111101111001100111000111111111000111011011110110101110011101011000111000000001111111010001111100100001111111011101010101001100011111011110011001110001111111110001110110111101111111011101101011110 b1c03fa3e43fbaa98fbcce3fe3b7b5ceb1c03fa3e43fbaa98fbcce3fe3b7bfbb5e
UTF-8 雲뜹d뤋懇庾먹齋砧雲뜹d뤋懇庾먹齋浸^ 11101001100110111011001011101011100111001011100111101111101111011000010011101011101001001000101111100110100001111000011111100101101110101011111011101011101010001011100111101001101111011000101111100111101000001010011111101001100110111011001011101011100111001011100111101111101111011000010011101011101001001000101111100110100001111000011111100101101110101011111011101011101010001011100111101001101111011000101111100110101101011011100001011110 e99bb2eb9cb9efbd84eba48be68787e5babeeba8b9e9bd8be7a0a7e99bb2eb9cb9efbd84eba48be68787e5babeeba8b9e9bd8be6b5b85e
UHC 雲뜹d뤋懇庾먹齋砧雲뜹d뤋懇庾먹齋浸^ 11101010101000111011011011100101101000111110010010001111101110111100101011010000111010101110110010111000110101001110111010110001111101101101101111101010101000111011011011100101101000111110010010001111101110111100101011010000111010101110110010111000110101001110111010110001111101101101100101011110 eaa3b6e5a3e48fbbcad0eaecb8d4eeb1f6dbeaa3b6e5a3e48fbbcad0eaecb8d4eeb1f6d95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)