To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????n}??????????n{^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111011011100111110100111111001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 鄙ケム峨★鄙ケム禍・n}鄙ケム峨★鄙ケム禍・n{^ 1110011110111111101110011101000110001001111000111000000110011010111001111011111110111001110100011000100111010000101001010110111001111101111001111011111110111001110100011000100111100011100000011001101011100111101111111011100111010001100010011101000010100101011011100111101101011110 e7bfb9d189e3819ae7bfb9d189d0a56e7de7bfb9d189e3819ae7bfb9d189d0a56e7b5e
EUC-JP 鄙ケム峨★鄙ケム禍・n}鄙ケム峨★鄙ケム禍・n{^ 111011101100000110001110101110011000111011010001101100101110010110100001111110101110111011000001100011101011100110001110110100011011001011010010100011101010010101101110011111011110111011000001100011101011100110001110110100011011001011100101101000011111101011101110110000011000111010111001100011101101000110110010110100101000111010100101011011100111101101011110 eec18eb98ed1b2e5a1faeec18eb98ed1b2d28ea56e7deec18eb98ed1b2e5a1faeec18eb98ed1b2d28ea56e7b5e
UTF-8 鄙ケム峨★鄙ケム禍・n}鄙ケム峨★鄙ケム禍・n{^ 1110100110000100100110011110111110111101101110011110111110111110100100011110010110110011101010001110001010011000100001011110100110000100100110011110111110111101101110011110111110111110100100011110011110100110100011011110111110111101101001010110111001111101111010011000010010011001111011111011110110111001111011111011111010010001111001011011001110101000111000101001100010000101111010011000010010011001111011111011110110111001111011111011111010010001111001111010011010001101111011111011110110100101011011100111101101011110 e98499efbdb9efbe91e5b3a8e29885e98499efbdb9efbe91e7a68defbda56e7de98499efbdb9efbe91e5b3a8e29885e98499efbdb9efbe91e7a68defbda56e7b5e
UHC 鄙??峨★鄙??禍?n}鄙??峨★鄙??禍?n{^ 1101111010101001001111110011111111100100101100011010000111011010110111101010100100111111001111111111110010100001001111110110111001111101110111101010100100111111001111111110010010110001101000011101101011011110101010010011111100111111111111001010000100111111011011100111101101011110 dea93f3fe4b1a1dadea93f3ffca13f6e7ddea93f3fe4b1a1dadea93f3ffca13f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)