To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????R?????^[?????R?????^[^ 001111110011111100111111001111110011111101010010001111110011111100111111001111110011111101011110010110110011111100111111001111110011111100111111010100100011111100111111001111110011111100111111010111100101101101011110 3f3f3f3f3f523f3f3f3f3f5e5b3f3f3f3f3f523f3f3f3f3f5e5b5e
SJIS-WIN ???嚥?R???嚥?^[???嚥?R???嚥?^[^ 00111111001111110011111110011010100010110011111101010010001111110011111100111111100110101000101100111111010111100101101100111111001111110011111110011010100010110011111101010010001111110011111100111111100110101000101100111111010111100101101101011110 3f3f3f9a8b3f523f3f3f9a8b3f5e5b3f3f3f9a8b3f523f3f3f9a8b3f5e5b5e
EUC-JP ???嚥?R???嚥?^[???嚥?R???嚥?^[^ 00111111001111110011111111010011111010110011111101010010001111110011111100111111110100111110101100111111010111100101101100111111001111110011111111010011111010110011111101010010001111110011111100111111110100111110101100111111010111100101101101011110 3f3f3fd3eb3f523f3f3fd3eb3f5e5b3f3f3fd3eb3f523f3f3fd3eb3f5e5b5e
UTF-8 若등뇴嚥쁭R若등뇴嚥쁭^[若등뇴嚥쁭R若등뇴嚥쁭^[^ 11101111101001011011010011101011100100111011000111101011100001111011010011100101100110101010010111101100100000011010110101010010111011111010010110110100111010111001001110110001111010111000011110110100111001011001101010100101111011001000000110101101010111100101101111101111101001011011010011101011100100111011000111101011100001111011010011100101100110101010010111101100100000011010110101010010111011111010010110110100111010111001001110110001111010111000011110110100111001011001101010100101111011001000000110101101010111100101101101011110 efa5b4eb93b1eb87b4e59aa5ec81ad52efa5b4eb93b1eb87b4e59aa5ec81ad5e5befa5b4eb93b1eb87b4e59aa5ec81ad52efa5b4eb93b1eb87b4e59aa5ec81ad5e5b5e
UHC 若등뇴嚥쁭R若등뇴嚥쁭^[若등뇴嚥쁭R若등뇴嚥쁭^[^ 1110010110101110101101011110111010000111100110001110011010111111100110000110111001010010111001011010111010110101111011101000011110011000111001101011111110011000011011100101111001011011111001011010111010110101111011101000011110011000111001101011111110011000011011100101001011100101101011101011010111101110100001111001100011100110101111111001100001101110010111100101101101011110 e5aeb5ee8798e6bf986e52e5aeb5ee8798e6bf986e5e5be5aeb5ee8798e6bf986e52e5aeb5ee8798e6bf986e5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)