To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 臍?l肢?臍?嶼堤拒臍?l肢?臍?嶼堤居^ 1110010001100000001111111000001010001100100011101000100000111111111001000110000000111111100110111101011110010010111001111000101110010001111001000110000000111111100000101000110010001110100010000011111111100100011000000011111110011011110101111001001011100111100010111000111101011110 e4603f828c8e883fe4603f9bd792e78b91e4603f828c8e883fe4603f9bd792e78b8f5e
EUC-JP 臍?l肢?臍?嶼堤拒臍?l肢?臍?嶼堤居^ 1110011111000001001111111010001111101100101110111110100000111111111001111100000100111111110101101101100111000100111010011011010111110001111001111100000100111111101000111110110010111011111010000011111111100111110000010011111111010110110110011100010011101001101101011110111101011110 e7c13fa3ecbbe83fe7c13fd6d9c4e9b5f1e7c13fa3ecbbe83fe7c13fd6d9c4e9b5ef5e
UTF-8 臍멨l肢렗臍멧嶼堤拒臍멨l肢렗臍멧嶼堤居^ 11101000100001111000110111101011101010011010100011101111101111011000110011101000100000101010001011101011101000001001011111101000100001111000110111101011101010011010011111100101101101101011110011100101101000001010010011100110100010111001001011101000100001111000110111101011101010011010100011101111101111011000110011101000100000101010001011101011101000001001011111101000100001111000110111101011101010011010011111100101101101101011110011100101101000001010010011100101101100011000010101011110 e8878deba9a8efbd8ce882a2eba097e8878deba9a7e5b6bce5a0a4e68b92e8878deba9a8efbd8ce882a2eba097e8878deba9a7e5b6bce5a0a4e5b1855e
UHC 臍멨l肢렗臍멧嶼堤拒臍멨l肢렗臍멧嶼堤居^ 1111000010110000101110001110010110100011111011001111001010110110100011101010110011110000101100001011100011100100110111111110110011110000101001111100101111011110111100001011000010111000111001011010001111101100111100101011011010001110101011001111000010110000101110001110010011011111111011001111000010100111110010111101110001011110 f0b0b8e5a3ecf2b68eacf0b0b8e4dfecf0a7cbdef0b0b8e5a3ecf2b68eacf0b0b8e4dfecf0a7cbdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)