To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 關ャ鬧育オり成鬧郁註關ャ鬧育オり成鬧郁註^ 11101000100100001010110011101001101001111000100011100111101101011000001011101000100100001010110011101001101001111000100011101000100100101001000011101000100100001010110011101001101001111000100011100111101101011000001011101000100100001010110011101001101001111000100011101000100100101001000001011110 e890ace9a788e7b582e890ace9a788e89290e890ace9a788e7b582e890ace9a788e892905e
EUC-JP 關ャ鬧育オり成鬧郁註關ャ鬧育オり成鬧郁註^ 1110111111110000100011101010110011110010101010011011000011101001100011101011010110100100111010101100000010101110111100101010100110110000111010101100001111110000111011111111000010001110101011001111001010101001101100001110100110001110101101011010010011101010110000001010111011110010101010011011000011101010110000111111000001011110 eff08eacf2a9b0e98eb5a4eac0aef2a9b0eac3f0eff08eacf2a9b0e98eb5a4eac0aef2a9b0eac3f05e
UTF-8 關ャ鬧育オり成鬧郁註關ャ鬧育オり成鬧郁註^ 11101001100101111001110011101111101111011010110011101001101011001010011111101000100000101011001011101111101111011011010111100011100000101000101011100110100010001001000011101001101011001010011111101001100000111000000111101000101010001011101111101001100101111001110011101111101111011010110011101001101011001010011111101000100000101011001011101111101111011011010111100011100000101000101011100110100010001001000011101001101011001010011111101001100000111000000111101000101010001011101101011110 e9979cefbdace9aca7e882b2efbdb5e3828ae68890e9aca7e98381e8a8bbe9979cefbdace9aca7e882b2efbdb5e3828ae68890e9aca7e98381e8a8bb5e
UHC 關?鬧育?り成鬧郁註關?鬧育?り成鬧郁註^ 11001110101111000011111111010111101000101110101111000000001111111010101011101010111000001111011111010111101000101110100111110100111100011100100111001110101111000011111111010111101000101110101111000000001111111010101011101010111000001111011111010111101000101110100111110100111100011100100101011110 cebc3fd7a2ebc03faaeae0f7d7a2e9f4f1c9cebc3fd7a2ebc03faaeae0f7d7a2e9f4f1c95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)