To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????º????????º^ 00111111001111110011111100111111001111110011111100111111001111111011101000111111001111110011111100111111001111110011111100111111001111111011101001011110 3f3f3f3f3f3f3f3fba3f3f3f3f3f3f3f3fba5e
SJIS-WIN 癲l?泣?ぜ擬??癲l?泣?ぜ擬??^ 1110000110011111100000101000110000111111100010111000001100111111100000101011101010001011010110110011111100111111111000011001111110000010100011000011111110001011100000110011111110000010101110101000101101011011001111110011111101011110 e19f828c3f8b833f82ba8b5b3f3fe19f828c3f8b833f82ba8b5b3f3f5e
EUC-JP 癲l?泣?ぜ擬?º癲l?泣?ぜ擬?º^ 111000101010000110100011111011000011111110110101111000110011111110100100101111001011010110111100001111111000111110100010111010111110001010100001101000111110110000111111101101011110001100111111101001001011110010110101101111000011111110001111101000101110101101011110 e2a1a3ec3fb5e33fa4bcb5bc3f8fa2ebe2a1a3ec3fb5e33fa4bcb5bc3f8fa2eb5e
UTF-8 癲l옕泣뽬ぜ擬듭º癲l옕泣뽬ぜ擬듭º^ 1110011110011001101100101110111110111101100011001110110010011000100101011110011010110011101000111110101110111101101011001110001110000001100111001110011010010011101011001110101110010011101011011100001010111010111001111001100110110010111011111011110110001100111011001001100010010101111001101011001110100011111010111011110110101100111000111000000110011100111001101001001110101100111010111001001110101101110000101011101001011110 e799b2efbd8cec9895e6b3a3ebbdace3819ce693aceb93adc2bae799b2efbd8cec9895e6b3a3ebbdace3819ce693aceb93adc2ba5e
UHC 癲l옕泣뽬ぜ擬듭º癲l옕泣뽬ぜ擬듭º^ 11101111101001101010001111101100100111101001101111101011111010001001011011101000101010101011110011101011111101001011010111101100101010001010110011101111101001101010001111101100100111101001101111101011111010001001011011101000101010101011110011101011111101001011010111101100101010001010110001011110 efa6a3ec9e9bebe896e8aabcebf4b5eca8acefa6a3ec9e9bebe896e8aabcebf4b5eca8ac5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)