To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????[??????????[^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011011001111110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN ????遭????遭[????遭????遭[^ 001111110011111100111111001111111001000110011000001111110011111100111111001111111001000110011000010110110011111100111111001111110011111110010001100110000011111100111111001111110011111110010001100110000101101101011110 3f3f3f3f91983f3f3f3f91985b3f3f3f3f91983f3f3f3f91985b5e
EUC-JP ????遭????遭[????遭????遭[^ 001111110011111100111111001111111100000111111000001111110011111100111111001111111100000111111000010110110011111100111111001111110011111111000001111110000011111100111111001111110011111111000001111110000101101101011110 3f3f3f3fc1f83f3f3f3fc1f85b3f3f3f3fc1f83f3f3f3fc1f85b5e
UTF-8 센솖센섹遭센솖센섹遭[센솖센섹遭센솖센섹遭[^ 111011001000010010111100111011001000011010010110111011001000010010111100111011001000010010111001111010011000000110101101111011001000010010111100111011001000011010010110111011001000010010111100111011001000010010111001111010011000000110101101010110111110110010000100101111001110110010000110100101101110110010000100101111001110110010000100101110011110100110000001101011011110110010000100101111001110110010000110100101101110110010000100101111001110110010000100101110011110100110000001101011010101101101011110 ec84bcec8696ec84bcec84b9e981adec84bcec8696ec84bcec84b9e981ad5bec84bcec8696ec84bcec84b9e981adec84bcec8696ec84bcec84b9e981ad5b5e
UHC 센솖센섹遭센솖센섹遭[센솖센섹遭센솖센섹遭[^ 10111100101111101011110011010111101111001011111010111100101111011111000011100100101111001011111010111100110101111011110010111110101111001011110111110000111001000101101110111100101111101011110011010111101111001011111010111100101111011111000011100100101111001011111010111100110101111011110010111110101111001011110111110000111001000101101101011110 bcbebcd7bcbebcbdf0e4bcbebcd7bcbebcbdf0e45bbcbebcd7bcbebcbdf0e4bcbebcd7bcbebcbdf0e45b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)