To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 厭ャ?壹??楡④?n}厭ャ?壹??楡④?n{^ 100010010111110110000011100000110011111110011010111000110011111100111111100111101011111010000111010000110011111101101110011111011000100101111101100000111000001100111111100110101110001100111111001111111001111010111110100001110100001100111111011011100111101101011110 897d83833f9ae33f3f9ebe87433f6e7d897d83833f9ae33f3f9ebe87433f6e7b5e
EUC-JP 厭ャ?壹??楡??n}厭ャ?壹??楡??n{^ 10110001110111101010010111100011001111111101010011100101001111110011111111011100110000000011111100111111011011100111110110110001110111101010010111100011001111111101010011100101001111110011111111011100110000000011111100111111011011100111101101011110 b1dea5e33fd4e53f3fdcc03f3f6e7db1dea5e33fd4e53f3fdcc03f3f6e7b5e
UTF-8 厭ャ깿壹븝쫿楡④퉹n}厭ャ깿壹븝쫿楡④퉹n{^ 1110010110001110101011011110001110000011101000111110101010111001101111111110010110100011101110011110101110111000100111011110110010101011101111111110011010100101101000011110001010010001101000111110110110001001101110010110111001111101111001011000111010101101111000111000001110100011111010101011100110111111111001011010001110111001111010111011100010011101111011001010101110111111111001101010010110100001111000101001000110100011111011011000100110111001011011100111101101011110 e58eade383a3eab9bfe5a3b9ebb89decabbfe6a5a1e291a3ed89b96e7de58eade383a3eab9bfe5a3b9ebb89decabbfe6a5a1e291a3ed89b96e7b5e
UHC 厭ャ깿壹븝쫿楡④퉹n}厭ャ깿壹븝쫿楡④퉹n{^ 1110011011110100101010111110001110000011101010001110110011101100101110101110111110100110100101101110101011111000101010001110101010111001100100010110111001111101111001101111010010101011111000111000001110101000111011001110110010111010111011111010011010010110111010101111100010101000111010101011100110010001011011100111101101011110 e6f4abe383a8ececbaefa696eaf8a8eab9916e7de6f4abe383a8ececbaefa696eaf8a8eab9916e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)