To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????T 00111111001111110011111100111111001111110011111100111111001111110011111101010100 3f3f3f3f3f3f3f3f3f54
SJIS-WIN 茫e梱郤わ醜羝э什T 11100100101010011000001010000101100011011010101111100111101110101000001011101101100011110101100011100011101101101000010010001111100011110101100101010100 e4a982858dabe7ba82ed8f58e3b6848f8f5954
EUC-JP 茫e梱郤わ醜羝э什T 11101000101010111010001111100101101110101010110111101110101111001010010011101111101111011011100111100110101110001010011111101111101111011011101001010100 e8aba3e5baadeebca4efbdb9e6b8a7efbdba54
UTF-8 茫e梱郤わ醜羝э什T 111010001000110010101011111011111011110110000101111001101010001010110001111010011000001110100100111000111000001010001111111010011000011010011100111001111011111010011101110100011000110111100100101110111000000001010100 e88cabefbd85e6a2b1e983a4e3828fe9869ce7be9dd18de4bb8054
UHC 茫e梱?わ醜?э什T 1101100011010100101000111110010111001101111000010011111110101010111011111111010111011101001111111010110011101111111001001010011101010100 d8d4a3e5cde13faaeff5dd3facefe4a754

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)