To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????R?????^[?????R?????^[^ 001111110011111100111111001111110011111101010010001111110011111100111111001111110011111101011110010110110011111100111111001111110011111100111111010100100011111100111111001111110011111100111111010111100101101101011110 3f3f3f3f3f523f3f3f3f3f5e5b3f3f3f3f3f523f3f3f3f3f5e5b5e
SJIS-WIN 嚥????R嚥????^[嚥????R嚥????^[^ 10011010100010110011111100111111001111110011111101010010100110101000101100111111001111110011111100111111010111100101101110011010100010110011111100111111001111110011111101010010100110101000101100111111001111110011111100111111010111100101101101011110 9a8b3f3f3f3f529a8b3f3f3f3f5e5b9a8b3f3f3f3f529a8b3f3f3f3f5e5b5e
EUC-JP 嚥????R嚥????^[嚥????R嚥????^[^ 11010011111010110011111100111111001111110011111101010010110100111110101100111111001111110011111100111111010111100101101111010011111010110011111100111111001111110011111101010010110100111110101100111111001111110011111100111111010111100101101101011110 d3eb3f3f3f3f52d3eb3f3f3f3f5e5bd3eb3f3f3f3f52d3eb3f3f3f3f5e5b5e
UTF-8 嚥싲틹鱗퐊R嚥싲틹鱗퐊^[嚥싲틹鱗퐊R嚥싲틹鱗퐊^[^ 11100101100110101010010111101100100010111011001011101101100010111011100111101111101001111011001011101101100100001000101001010010111001011001101010100101111011001000101110110010111011011000101110111001111011111010011110110010111011011001000010001010010111100101101111100101100110101010010111101100100010111011001011101101100010111011100111101111101001111011001011101101100100001000101001010010111001011001101010100101111011001000101110110010111011011000101110111001111011111010011110110010111011011001000010001010010111100101101101011110 e59aa5ec8bb2ed8bb9efa7b2ed908a52e59aa5ec8bb2ed8bb9efa7b2ed908a5e5be59aa5ec8bb2ed8bb9efa7b2ed908a52e59aa5ec8bb2ed8bb9efa7b2ed908a5e5b5e
UHC 嚥싲틹鱗퐊R嚥싲틹鱗퐊^[嚥싲틹鱗퐊R嚥싲틹鱗퐊^[^ 1110011010111111100110101110101110111010100111111110110011100111101111010110111001010010111001101011111110011010111010111011101010011111111011001110011110111101011011100101111001011011111001101011111110011010111010111011101010011111111011001110011110111101011011100101001011100110101111111001101011101011101110101001111111101100111001111011110101101110010111100101101101011110 e6bf9aebba9fece7bd6e52e6bf9aebba9fece7bd6e5e5be6bf9aebba9fece7bd6e52e6bf9aebba9fece7bd6e5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)