To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 悠?杖?悠?肢?n}悠?杖?悠?肢?n{^ 1001011101001001001111111000111111110001001111111001011101001001001111111000111010001000001111110110111001111101100101110100100100111111100011111111000100111111100101110100100100111111100011101000100000111111011011100111101101011110 97493f8ff13f97493f8e883f6e7d97493f8ff13f97493f8e883f6e7b5e
EUC-JP 悠?杖?悠?肢?n}悠?杖?悠?肢?n{^ 1100110110101010001111111011111011110011001111111100110110101010001111111011101111101000001111110110111001111101110011011010101000111111101111101111001100111111110011011010101000111111101110111110100000111111011011100111101101011110 cdaa3fbef33fcdaa3fbbe83f6e7dcdaa3fbef33fcdaa3fbbe83f6e7b5e
UTF-8 悠렓杖렦悠렓肢렖n}悠렓杖렦悠렓肢렖n{^ 1110011010000010101000001110101110100000100100111110011010011101100101101110101110100000101001101110011010000010101000001110101110100000100100111110100010000010101000101110101110100000100101100110111001111101111001101000001010100000111010111010000010010011111001101001110110010110111010111010000010100110111001101000001010100000111010111010000010010011111010001000001010100010111010111010000010010110011011100111101101011110 e682a0eba093e69d96eba0a6e682a0eba093e882a2eba0966e7de682a0eba093e69d96eba0a6e682a0eba093e882a2eba0966e7b5e
UHC 悠렓杖렦悠렓肢렖n}悠렓杖렦悠렓肢렖n{^ 11101010111011011000111010101000111011011110100010001110101101011110101011101101100011101010100011110010101101101000111010101011011011100111110111101010111011011000111010101000111011011110100010001110101101011110101011101101100011101010100011110010101101101000111010101011011011100111101101011110 eaed8ea8ede88eb5eaed8ea8f2b68eab6e7deaed8ea8ede88eb5eaed8ea8f2b68eab6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)