To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 哀??筌??蹂????????音??厭??B 100010001010001100111111001111111110001010100011001111110011111111100110111110000011111100111111001111110011111100111111001111110011111100111111100010011011100100111111001111111000100101111101001111110011111101000010 88a33f3fe2a33f3fe6f83f3f3f3f3f3f3f3f89b93f3f897d3f3f42
EUC-JP 哀??筌??蹂????????音??厭??B 101100001010010100111111001111111110010010100101001111110011111111101100111110100011111100111111001111110011111100111111001111110011111100111111101100101011101100111111001111111011000111011110001111110011111101000010 b0a53f3fe4a53f3fecfa3f3f3f3f3f3f3f3fb2bb3f3fb1de3f3f42
UTF-8 哀잙젦筌좎뇴蹂뺟뼇溜묈맅溜욑쭬音섎퀡厭묐젒B 11100101100100111000000011101100100111101001100111101100101000001010011011100111101011011000110011101100101000101000111011101011100001111011010011101000101110011000001011101011101110101001111111101011101111001000011111101111101001111000101111101011101011001000100011101011101001111000010111101111101001111000101111101100100110101001000111101100101011011010110011101001100111111011001111101100100001001000111011101101100000001010000111100101100011101010110111101011101011001001000011101100101000001001001001000010 e59380ec9e99eca0a6e7ad8ceca28eeb87b4e8b982ebba9febbc87efa78bebac88eba785efa78bec9a91ecadace99fb3ec848eed80a1e58eadebac90eca09242
UHC 哀잙젦筌좎뇴蹂뺟뼇溜묈맅溜욑쭬音섎퀡厭묐젒B 11100100111011101001111111101011101000001001111011101111101001111010000011101100100001111001100011101011101100111001010111100111100101101001000111101010111111101001000111100101100100001001111111101010111111101001111011101111101001111010000011101011111001011001100011101011101100111001010111100110111101001001000111101011101000001001000101000010 e4ee9feba09eefa7a0ec8798ebb395e79691eafe91e5909feafe9eefa7a0ebe598ebb395e6f491eba09142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)