To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 雲?d?巽??齋賞雲?d?巽??齋詳^ 1000100101011111001111111000001010000100001111111001001001000110001111110011111111100010010101101000111111011100100010010101111100111111100000101000010000111111100100100100011000111111001111111110001001010110100011111101101001011110 895f3f82843f92463f3fe2568fdc895f3f82843f92463f3fe2568fda5e
EUC-JP 雲?d?巽庾?齋賞雲?d?巽庾?齋詳^ 101100011100000000111111101000111110010000111111110000111010011110001111101111001100111000111111111000111011011110111110110111101011000111000000001111111010001111100100001111111100001110100111100011111011110011001110001111111110001110110111101111101101110001011110 b1c03fa3e43fc3a78fbcce3fe3b7bedeb1c03fa3e43fc3a78fbcce3fe3b7bedc5e
UTF-8 雲뜹d뤊巽庾먹齋賞雲뜹d뤊巽庾먹齋詳^ 11101001100110111011001011101011100111001011100111101111101111011000010011101011101001001000101011100101101101111011110111100101101110101011111011101011101010001011100111101001101111011000101111101000101100111001111011101001100110111011001011101011100111001011100111101111101111011000010011101011101001001000101011100101101101111011110111100101101110101011111011101011101010001011100111101001101111011000101111101000101010011011001101011110 e99bb2eb9cb9efbd84eba48ae5b7bde5babeeba8b9e9bd8be8b39ee99bb2eb9cb9efbd84eba48ae5b7bde5babeeba8b9e9bd8be8a9b35e
UHC 雲뜹d뤊巽庾먹齋賞雲뜹d뤊巽庾먹齋詳^ 11101010101000111011011011100101101000111110010010001111101110101110000111011110111010101110110010111000110101001110111010110001110111111101101111101010101000111011011011100101101000111110010010001111101110101110000111011110111010101110110010111000110101001110111010110001110111111101100101011110 eaa3b6e5a3e48fbae1deeaecb8d4eeb1dfdbeaa3b6e5a3e48fbae1deeaecb8d4eeb1dfd95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)