To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 楷?海?楷?海?n}楷?海?楷?海?n{^ 1001111010110010001111111000101001000011001111111001111010110010001111111000101001000011001111110110111001111101100111101011001000111111100010100100001100111111100111101011001000111111100010100100001100111111011011100111101101011110 9eb23f8a433f9eb23f8a433f6e7d9eb23f8a433f9eb23f8a433f6e7b5e
EUC-JP 楷?海?楷?海?n}楷?海?楷?海?n{^ 1101110010110100001111111011001110100100001111111101110010110100001111111011001110100100001111110110111001111101110111001011010000111111101100111010010000111111110111001011010000111111101100111010010000111111011011100111101101011110 dcb43fb3a43fdcb43fb3a43f6e7ddcb43fb3a43fdcb43fb3a43f6e7b5e
UTF-8 楷렠海렲楷렠海렲n}楷렠海렲楷렠海렲n{^ 1110011010100101101101111110101110100000101000001110011010110101101101111110101110100000101100101110011010100101101101111110101110100000101000001110011010110101101101111110101110100000101100100110111001111101111001101010010110110111111010111010000010100000111001101011010110110111111010111010000010110010111001101010010110110111111010111010000010100000111001101011010110110111111010111010000010110010011011100111101101011110 e6a5b7eba0a0e6b5b7eba0b2e6a5b7eba0a0e6b5b7eba0b26e7de6a5b7eba0a0e6b5b7eba0b2e6a5b7eba0a0e6b5b7eba0b26e7b5e
UHC 楷렠海렲楷렠海렲n}楷렠海렲楷렠海렲n{^ 11111010101011001000111010110001111110101010110110001110101111111111101010101100100011101011000111111010101011011000111010111111011011100111110111111010101011001000111010110001111110101010110110001110101111111111101010101100100011101011000111111010101011011000111010111111011011100111101101011110 faac8eb1faad8ebffaac8eb1faad8ebf6e7dfaac8eb1faad8ebffaac8eb1faad8ebf6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)