To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????u}?????????u{^ 0011111100111111001111110011111100111111001111110011111100111111001111110111010101111101001111110011111100111111001111110011111100111111001111110011111100111111011101010111101101011110 3f3f3f3f3f3f3f3f3f757d3f3f3f3f3f3f3f3f3f757b5e
SJIS-WIN 臾????П??Сu}臾????П??Сu{^ 1110010001101011001111110011111100111111001111111000010001010000001111110011111110000100010100100111010101111101111001000110101100111111001111110011111100111111100001000101000000111111001111111000010001010010011101010111101101011110 e46b3f3f3f3f84503f3f8452757de46b3f3f3f3f84503f3f8452757b5e
EUC-JP 臾????П??Сu}臾????П??Сu{^ 1110011111001100001111110011111100111111001111111010011110110001001111110011111110100111101100110111010101111101111001111100110000111111001111110011111100111111101001111011000100111111001111111010011110110011011101010111101101011110 e7cc3f3f3f3fa7b13f3fa7b3757de7cc3f3f3f3fa7b13f3fa7b3757b5e
UTF-8 臾룸쳺紐븍П紐용Сu}臾룸쳺紐븍П紐용Сu{^ 11101000100001111011111011101011101000111011100011101100101100111011101011101111101001111000111111101011101110001000110111010000100111111110111110100111100011111110110010011010101010011101000010100001011101010111110111101000100001111011111011101011101000111011100011101100101100111011101011101111101001111000111111101011101110001000110111010000100111111110111110100111100011111110110010011010101010011101000010100001011101010111101101011110 e887beeba3b8ecb3baefa78febb88dd09fefa78fec9aa9d0a1757de887beeba3b8ecb3baefa78febb88dd09fefa78fec9aa9d0a1757b5e
UHC 臾룸쳺紐븍П紐용Сu}臾룸쳺紐븍П紐용Сu{^ 1110101110101100101101111110101110101011100111011110101110101010101110101110101110101100101100011110101110101010101111111110101110101100101100110111010101111101111010111010110010110111111010111010101110011101111010111010101010111010111010111010110010110001111010111010101010111111111010111010110010110011011101010111101101011110 ebacb7ebab9debaabaebacb1ebaabfebacb3757debacb7ebab9debaabaebacb1ebaabfebacb3757b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)