To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 雲?d?巽??齋韓雲?d?巽??齋閒^ 1000100101011111001111111000001010000100001111111001001001000110001111110011111111100010010101101000101011011000100010010101111100111111100000101000010000111111100100100100011000111111001111111110001001010110111110111110100001011110 895f3f82843f92463f3fe2568ad8895f3f82843f92463f3fe256fbe85e
EUC-JP 雲?d?巽庾?齋韓雲?d?巽庾?齋?^ 1011000111000000001111111010001111100100001111111100001110100111100011111011110011001110001111111110001110110111101101001101101010110001110000000011111110100011111001000011111111000011101001111000111110111100110011100011111111100011101101110011111101011110 b1c03fa3e43fc3a78fbcce3fe3b7b4dab1c03fa3e43fc3a78fbcce3fe3b73f5e
UTF-8 雲뜹d뤊巽庾먹齋韓雲뜹d뤊巽庾먹齋閒^ 11101001100110111011001011101011100111001011100111101111101111011000010011101011101001001000101011100101101101111011110111100101101110101011111011101011101010001011100111101001101111011000101111101001100111111001001111101001100110111011001011101011100111001011100111101111101111011000010011101011101001001000101011100101101101111011110111100101101110101011111011101011101010001011100111101001101111011000101111101001100101101001001001011110 e99bb2eb9cb9efbd84eba48ae5b7bde5babeeba8b9e9bd8be99f93e99bb2eb9cb9efbd84eba48ae5b7bde5babeeba8b9e9bd8be996925e
UHC 雲뜹d뤊巽庾먹齋韓雲뜹d뤊巽庾먹齋閒^ 11101010101000111011011011100101101000111110010010001111101110101110000111011110111010101110110010111000110101001110111010110001111110011101101111101010101000111011011011100101101000111110010010001111101110101110000111011110111010101110110010111000110101001110111010110001111110011101100101011110 eaa3b6e5a3e48fbae1deeaecb8d4eeb1f9dbeaa3b6e5a3e48fbae1deeaecb8d4eeb1f9d95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)