To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 野????淫??夭?)肉η?碎??癌??裕? 10010110111011000011111100111111001111110011111110001000111110100011111100111111100110101110111000111111100000010110101010010011111101111000001111000101001111111110000111101010001111110011111110001010111000000011111100111111100101110101010000111111 96ec3f3f3f3f88fa3f3f9aee3f816a93f783c53fe1ea3f3f8ae03f3f97543f
EUC-JP 野????淫??夭?)肉η?碎??癌??裕? 11001100111011100011111100111111001111110011111110110000111111000011111100111111110101001111000000111111101000011100101111000110111110011010011011000111001111111110001011101100001111110011111110110100111000100011111100111111110011011011010100111111 ccee3f3f3f3fb0fc3f3fd4f03fa1cbc6f9a6c73fe2ec3f3fb4e23f3fcdb53f
UTF-8 野ㅞ뼛욥뇡淫됰뤉夭곕)肉η솒碎ㅽ닡癌껋눛裕켇 1110100110000111100011101110001110000101100111101110101110111100100110111110110010011010101001011110101110000111101000011110011010110111101010111110101110010000101100001110101110100100100010011110010110100100101011011110101010110011100101011110111110111100100010011110100010000010100010011100111010110111111011001000011010010010111001111010001010001110111000111000010110111101111010111000101110100001111001111001100110001100111010101011101110001011111010111000100010011011111010001010001110010101111011001011110010000111 e9878ee3859eebbc9bec9aa5eb87a1e6b7abeb90b0eba489e5a4adeab395efbc89e88289ceb7ec8692e7a28ee385bdeb8ba1e7998ceabb8beb889be8a395ecbc87
UHC 野ㅞ뼛욥뇡淫됰뤉夭곕)肉η솒碎ㅽ닡癌껋눛裕켇 1110010110101111101001001100111010111011110001001011111111101001100001111000100111101011111000101000100111101011100011111011100111101000111011001011000011101011101000111010100111101011101111111010010111100111100110011001001011100001111011111010010011101101100010001010000111100100110111111000001111101100100001111011001111101011101011101011000101000101 e5afa4cebbc4bfe98789ebe289eb8fb9e8ecb0eba3a9ebbfa5e79992e1efa4ed88a1e4df83ec87b3ebaeb145

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)