To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 艾?????衣??乙???c?肉η?碎κ? 1110010010001000001111110011111100111111001111110011111110001000110111110011111100111111100010011011001100111111001111110011111110000010100000110011111110010011111101111000001111000101001111111110000111101010100000111100100000111111 e4883f3f3f3f3f88df3f3f89b33f3f3f82833f93f783c53fe1ea83c83f
EUC-JP 艾?????衣??乙???c?肉η?碎κ? 1110011111101000001111110011111100111111001111110011111110110000111000010011111100111111101100101011010100111111001111110011111110100011111000110011111111000110111110011010011011000111001111111110001011101100101001101100101000111111 e7e83f3f3f3f3fb0e13f3fb2b53f3f3fa3e33fc6f9a6c73fe2eca6ca3f
UTF-8 艾싳궠梨욘룚衣껃뿗乙녹굻力c끂肉η솒碎κ퐥 11101000100010011011111011101100100010111011001111101010101101101010000011101111101001111010001011101100100110101001100011101011101000111001101011101000101000011010001111101010101110111000001111101011101111111001011111100100101110011001100111101011100001011011100111101010101101011011101111101111101001101000101011101111101111011000001111101011100000011000001011101000100000101000100111001110101101111110110010000110100100101110011110100010100011101100111010111010111011011001000010100101 e889beec8bb3eab6a0efa7a2ec9a98eba39ae8a1a3eabb83ebbf97e4b999eb85b9eab5bbefa68aefbd83eb8182e88289ceb7ec8692e7a28ecebaed90a5
UHC 艾싳궠梨욘룚衣껃뿗乙녹굻力c끂肉η솒碎κ퐥 111001001111010110011010111011001000001010110011111011001011000110111111111001101000111110010110111010111111110110000011111001011001011110011010111010111110000010110011111011001011000110111111111001101011001110100011111000111000010110111000111010111011111110100101111001111001100110010010111000011110111110100101111010101011110110001110 e4f59aec82b3ecb1bfe68f96ebfd83e5979aebe0b3ecb1bfe6b3a3e385b8ebbfa5e79992e1efa5eabd8e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)