To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 哀??冗????お液?????轅???ら?^ 10001000101000110011111100111111100011111110011100111111001111110011111100111111100000101010100010001001011101000011111100111111001111110011111100111111111001110111011000111111001111110011111110000010111001110011111101011110 88a33f3f8fe73f3f3f3f82a889743f3f3f3f3fe7763f3f3f82e73f5e
EUC-JP 哀??冗????お液?????轅???ら?^ 10110000101001010011111100111111101111101110100100111111001111110011111100111111101001001010101010110001110101010011111100111111001111110011111100111111111011011101011100111111001111110011111110100100111010010011111101011110 b0a53f3fbee93f3f3f3fa4aab1d53f3f3f3f3fedd73f3f3fa4e93f5e
UTF-8 哀잙젦冗밴옇淋끿お液ㅵ맅溜욇레轅명쉫囹ら턄^ 11100101100100111000000011101100100111101001100111101100101000001010011011100101100001101001011111101011101100001011010011101100100110001000011111101111101001111011010111101011100000011011111111100011100000011000101011100110101101101011001011100011100001011011010111101011101001111000010111101111101001111000101111101100100110101000011111101011101000001000100011101000101111011000010111101011101010101000010111101100100010011010101111101111101001101010100111100011100000101000100111101101100001001000010001011110 e59380ec9e99eca0a6e58697ebb0b4ec9887efa7b5eb81bfe3818ae6b6b2e385b5eba785efa78bec9a87eba088e8bd85ebaa85ec89abefa6a9e38289ed84845e
UHC 哀잙젦冗밴옇淋끿お液ㅵ맅溜욇레轅명쉫囹ら턄^ 11100100111011101001111111101011101000001001111011101001101101111011100111101010101111111011100011101100111110001000010111100111101010101010101011100100111110111010010011100101100100001001111111101010111111101001111011101001101101111011100111101010101111111011100011101101100110101000010111100111101010101010101011101001101101011010000001011110 e4ee9feba09ee9b7b9eabfb8ecf885e7aaaae4fba4e5909feafe9ee9b7b9eabfb8ed9a85e7aaaae9b5a05e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)