To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????E 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN 嚥≪?誼?┸袁⑥????嚥≪?誼?┸袁⑥????E 10011010100010111000000111100001001111111000101101100010001111111000010010111101111001011100110110000111010001010011111100111111001111110011111110011010100010111000000111100001001111111000101101100010001111111000010010111101111001011100110110000111010001010011111100111111001111110011111101000101 9a8b81e13f8b623f84bde5cd87453f3f3f3f9a8b81e13f8b623f84bde5cd87453f3f3f3f45
EUC-JP 嚥≪?誼?┸袁?????嚥≪?誼?┸袁?????E 1101001111101011101000101110001100111111101101011100001100111111101010001011111111101010110011110011111100111111001111110011111100111111110100111110101110100010111000110011111110110101110000110011111110101000101111111110101011001111001111110011111100111111001111110011111101000101 d3eba2e33fb5c33fa8bfeacf3f3f3f3f3fd3eba2e33fb5c33fa8bfeacf3f3f3f3f3f45
UTF-8 嚥≪룆誼띰┸袁⑥젆輦깊뤆嚥≪룆誼띰┸袁⑥젆輦깊뤉E 11100101100110101010010111100010100010011010101011101011101000111000011011101000101010101011110011101011100111011011000011100010100101001011100011101000101000101000000111100010100100011010010111101100101000001000011011101111101001101001100011101010101110011000101011101011101001001000011011100101100110101010010111100010100010011010101011101011101000111000011011101000101010101011110011101011100111011011000011100010100101001011100011101000101000101000000111100010100100011010010111101100101000001000011011101111101001101001100011101010101110011000101011101011101001001000100101000101 e59aa5e289aaeba386e8aabceb9db0e294b8e8a281e291a5eca086efa698eab98aeba486e59aa5e289aaeba386e8aabceb9db0e294b8e8a281e291a5eca086efa698eab98aeba48945
UHC 嚥≪룆誼띰┸袁⑥젆輦깊뤆嚥≪룆誼띰┸袁⑥젆輦깊뤉E 11100110101111111010000111101100100011111000010111101011111111101011011011101111101001101011111111101010101111101010100011101100101000001000100111100110111001001011000111101101100011111011011011100110101111111010000111101100100011111000010111101011111111101011011011101111101001101011111111101010101111101010100011101100101000001000100111100110111001001011000111101101100011111011100101000101 e6bfa1ec8f85ebfeb6efa6bfeabea8eca089e6e4b1ed8fb6e6bfa1ec8f85ebfeb6efa6bfeabea8eca089e6e4b1ed8fb945

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)