To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 厄????源??榮??日??淞る?五??踰 1001011011101111001111110011111100111111001111111000110010111001001111110011111110011110110001000011111100111111100100111111101000111111001111111001111111000010100000101110100100111111100011001101110000111111001111111110011011111010 96ef3f3f3f3f8cb93f3f9ec43f3f93fa3f3f9fc282e93f8cdc3f3fe6fa
EUC-JP 厄????源??榮??日??淞る?五??踰 1100110011110001001111110011111100111111001111111011100010111011001111110011111111011100110001100011111100111111110001101111110000111111001111111101111011000100101001001110101100111111101110001101111000111111001111111110110011111100 ccf13f3f3f3fb8bb3f3fdcc63f3fc6fc3f3fdec4a4eb3fb8de3f3fecfc
UTF-8 厄닌듑쀦틦源띿춷榮붽퍓日곩슖淞る듌五묐틷踰 111001011000111010000100111010111000101110001100111010111001001110010001111011001000000010100110111011011000101110100110111001101011101010010000111010111001110110111111111011001011011010110111111001101010011010101110111010111011011010111101111011011000110110010011111001101001011110100101111010101011001110101001111011001000101010010110111001101011011110011110111000111000001010001011111010111001001110001100111001001011101010010100111010111010110010010000111011011000101110110111111010001011100010110000 e58e84eb8b8ceb9391ec80a6ed8ba6e6ba90eb9dbfecb6b7e6a6aeebb6bded8d93e697a5eab3a9ec8a96e6b79ee3828beb938ce4ba94ebac90ed8bb7e8b8b0
UHC 厄닌듑쀦틦源띿춷榮붽퍓日곩슖淞る듌五묐틷踰 111001001111100010110100110100011000101011000011100101111110011010111010100100001110101010111001100011011110110010101101100100111110011110110100100101001110101010111011100010101110110011101101100000011110010110011010101001011110000111100111101010101110101110001010101111111110011111101001100100011110101110111010100111101110101110110010 e4f8b4d18ac397e6ba90eab98decad93e7b494eabb8aeced81e59aa5e1e7aaeb8abfe7e991ebba9eebb2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)