To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鼇??嚥??節θ?恂??哀??要ο?俉??^ 11101010100001110011111100111111100110101000101100111111001111111001000011011111100000111100011000111111100111001001011000111111001111111000100010100011001111110011111110010111011101101000001111001101001111111111101001100001001111110011111101011110 ea873f3f9a8b3f3f90df83c63f9c963f3f88a33f3f977683cd3ffa613f3f5e
EUC-JP 鼇??嚥?ı節θ?恂??哀??要ο?俉??^ 11110011111001110011111100111111110100111110101100111111100011111010100111000101110000001110000110100110110010000011111111010111111101100011111100111111101100001010010100111111001111111100110111010111101001101100111100111111100011111011000110111011001111110011111101011110 f3e73f3fd3eb3f8fa9c5c0e1a6c83fd7f63f3fb0a53f3fcdd7a6cf3f8fb1bb3f3f5e
UTF-8 鼇믣맃嚥ㅹı節θ뜵恂욆쮵哀앮엘要ο쉈俉녈깭^ 11101001101111001000011111101011101011111010001111101011101001111000001111100101100110101010010111100011100001011011100111000100101100011110011110101111100000001100111010111000111010111001110010110101111001101000000110000010111011001001101010000110111011001010111010110101111001011001001110000000111011001001010110101110111011001001011110011000111010001010011010000001110011101011111111101100100010011000100011100100101111111000100111101011100001011000100011101010101110011010110101011110 e9bc87ebafa3eba783e59aa5e385b9c4b1e7af80ceb8eb9cb5e68182ec9a86ecaeb5e59380ec95aeec9798e8a681cebfec8988e4bf89eb8588eab9ad5e
UHC 鼇믣맃嚥ㅹı節θ뜵恂욆쮵哀앮엘要ο쉈俉녈깭^ 11101000101010001001001011100101100100001001110111100110101111111010010011101001101010011010010111101111101111011010010111101000100011011011001111100010111000011001111011101000101010001001001011100100111011101001110111100110101111111010010011101001101010011010010111101111101111011010010111100111111010111011001111100011100000111001110001011110 e8a892e5909de6bfa4e9a9a5efbda5e88db3e2e19ee8a892e4ee9de6bfa4e9a9a5efbda5e7ebb3e3839c5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)