To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 瓦??壓??秧??n}瓦??壓??秧??n{^ 1000101010100010001111110011111110011010110110000011111100111111111000100101111000111111001111110110111001111101100010101010001000111111001111111001101011011000001111110011111111100010010111100011111100111111011011100111101101011110 8aa23f3f9ad83f3fe25e3f3f6e7d8aa23f3f9ad83f3fe25e3f3f6e7b5e
EUC-JP 瓦??壓??秧??n}瓦??壓??秧??n{^ 1011010010100100001111110011111111010100110110100011111100111111111000111011111100111111001111110110111001111101101101001010010000111111001111111101010011011010001111110011111111100011101111110011111100111111011011100111101101011110 b4a43f3fd4da3f3fe3bf3f3f6e7db4a43f3fd4da3f3fe3bf3f3f6e7b5e
UTF-8 瓦븀떁壓잒쓿秧껇즳n}瓦븀떁壓잒쓿秧껇즳n{^ 1110011110010011101001101110101110111000100000001110101110010110100000011110010110100011100100111110110010011110100100101110110010010011101111111110011110100111101001111110101010111011100001111110110010100110101100110110111001111101111001111001001110100110111010111011100010000000111010111001011010000001111001011010001110010011111011001001111010010010111011001001001110111111111001111010011110100111111010101011101110000111111011001010011010110011011011100111101101011110 e793a6ebb880eb9681e5a393ec9e92ec93bfe7a7a7eabb87eca6b36e7de793a6ebb880eb9681e5a393ec9e92ec93bfe7a7a7eabb87eca6b36e7b5e
UHC 瓦븀떁壓잒쓿秧껇즳n}瓦븀떁壓잒쓿秧껇즳n{^ 1110100010111111101110101110011110001011100101111110010011100010100111111110100010111110101101111110010011101011100000111110100010100011100001010110111001111101111010001011111110111010111001111000101110010111111001001110001010011111111010001011111010110111111001001110101110000011111010001010001110000101011011100111101101011110 e8bfbae78b97e4e29fe8beb7e4eb83e8a3856e7de8bfbae78b97e4e29fe8beb7e4eb83e8a3856e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)