To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???g??????v???g??????vB 0011111100111111001111110110011100111111001111110011111100111111001111110011111101110110001111110011111100111111011001110011111100111111001111110011111100111111001111110111011001000010 3f3f3f673f3f3f3f3f3f763f3f3f673f3f3f3f3f3f7642
SJIS-WIN 箋醐襲g箋醐酬箋醐愁v箋醐襲g箋醐酬箋醐愁vB 1110001010110011100011001110110110001111010100000110011111100010101100111000110011101101100011110101011011100010101100111000110011101101100011110100010001110110111000101011001110001100111011011000111101010000011001111110001010110011100011001110110110001111010101101110001010110011100011001110110110001111010001000111011001000010 e2b38ced8f5067e2b38ced8f56e2b38ced8f4476e2b38ced8f5067e2b38ced8f56e2b38ced8f447642
EUC-JP 箋醐襲g箋醐酬箋醐愁v箋醐襲g箋醐酬箋醐愁vB 1110010010110101101110001110111110111101101100010110011111100100101101011011100011101111101111011011011111100100101101011011100011101111101111011010010101110110111001001011010110111000111011111011110110110001011001111110010010110101101110001110111110111101101101111110010010110101101110001110111110111101101001010111011001000010 e4b5b8efbdb167e4b5b8efbdb7e4b5b8efbda576e4b5b8efbdb167e4b5b8efbdb7e4b5b8efbda57642
UTF-8 箋醐襲g箋醐酬箋醐愁v箋醐襲g箋醐酬箋醐愁vB 1110011110101110100010111110100110000110100100001110100010100101101100100110011111100111101011101000101111101001100001101001000011101001100001011010110011100111101011101000101111101001100001101001000011100110100001001000000101110110111001111010111010001011111010011000011010010000111010001010010110110010011001111110011110101110100010111110100110000110100100001110100110000101101011001110011110101110100010111110100110000110100100001110011010000100100000010111011001000010 e7ae8be98690e8a5b267e7ae8be98690e985ace7ae8be98690e6848176e7ae8be98690e8a5b267e7ae8be98690e985ace7ae8be98690e684817642
UHC 箋?襲g箋?酬箋?愁v箋?襲g箋?酬箋?愁vB 1110111110101000001111111110001110101001011001111110111110101000001111111110001011000110111011111010100000111111111000011111111001110110111011111010100000111111111000111010100101100111111011111010100000111111111000101100011011101111101010000011111111100001111111100111011001000010 efa83fe3a967efa83fe2c6efa83fe1fe76efa83fe3a967efa83fe2c6efa83fe1fe7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)