To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????M???????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111010011010011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f4d3f3f3f3f3f3f3f3f
SJIS-WIN 偲鴫篠質篠シナクナ悉M篠シト執篠湿篠質 10001110110000111000111010110000100011101100001010001110101111111000111011000010101111001100010110111000110001011000111010111011010011011000111011000010101111001100010010001110101101111000111011000010100011101011110010001110110000101000111010111111 8ec38eb08ec28ebf8ec2bcc5b8c58ebb4d8ec2bcc48eb78ec28ebc8ec28ebf
EUC-JP 偲鴫篠質篠シナクナ悉M篠シト執篠湿篠質 10111100110001011011110010110010101111001100010010111100110000011011110011000100100011101011110010001110110001011000111010111000100011101100010110111100101111010100110110111100110001001000111010111100100011101100010010111100101110011011110011000100101111001011111010111100110001001011110011000001 bcc5bcb2bcc4bcc1bcc48ebc8ec58eb88ec5bcbd4dbcc48ebc8ec4bcb9bcc4bcbebcc4bcc1
UTF-8 偲鴫篠質篠シナクナ悉M篠シト執篠湿篠質 11100101100000011011001011101001101101001010101111100111101011111010000011101000101100111010101011100111101011111010000011101111101111011011110011101111101111101000010111101111101111011011100011101111101111101000010111100110100000101000100101001101111001111010111110100000111011111011110110111100111011111011111010000100111001011001111110110111111001111010111110100000111001101011100110111111111001111010111110100000111010001011001110101010 e581b2e9b4abe7afa0e8b3aae7afa0efbdbcefbe85efbdb8efbe85e682894de7afa0efbdbcefbe84e59fb7e7afa0e6b9bfe7afa0e8b3aa
UHC ??篠質篠????悉M篠??執篠?篠質 00111111001111111110000111000110111100101111010111100001110001100011111100111111001111110011111111100011111110100100110111100001110001100011111100111111111100101111101111100001110001100011111111100001110001101111001011110101 3f3fe1c6f2f5e1c63f3f3f3fe3fa4de1c63f3ff2fbe1c63fe1c6f2f5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)