To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN ???猥る?鴦??}???猥る?鴦??{^ 001111110011111100111111111000001100111010000010111010010011111111101001111100010011111100111111011111010011111100111111001111111110000011001110100000101110100100111111111010011111000100111111001111110111101101011110 3f3f3fe0ce82e93fe9f13f3f7d3f3f3fe0ce82e93fe9f13f3f7b5e
EUC-JP ???猥る?鴦??}???猥る?鴦??{^ 001111110011111100111111111000001101000010100100111010110011111111110010111100110011111100111111011111010011111100111111001111111110000011010000101001001110101100111111111100101111001100111111001111110111101101011110 3f3f3fe0d0a4eb3ff2f33f3f7d3f3f3fe0d0a4eb3ff2f33f3f7b5e
UTF-8 惡욌젣猥る젪鴦잙젛}惡욌젣猥る젪鴦잙젛{^ 111011111010011010111001111011001001101010001100111011001010000010100011111001111000110010100101111000111000001010001011111011001010000010101010111010011011010010100110111011001001111010011001111011001010000010011011011111011110111110100110101110011110110010011010100011001110110010100000101000111110011110001100101001011110001110000010100010111110110010100000101010101110100110110100101001101110110010011110100110011110110010100000100110110111101101011110 efa6b9ec9a8ceca0a3e78ca5e3828beca0aae9b4a6ec9e99eca09b7defa6b9ec9a8ceca0a3e78ca5e3828beca0aae9b4a6ec9e99eca09b7b5e
UHC 惡욌젣猥る젪鴦잙젛}惡욌젣猥る젪鴦잙젛{^ 111001111111011110011110111010111010000010011100111010001110010110101010111010111010000010100010111001001110110010011111111010111010000010010111011111011110011111110111100111101110101110100000100111001110100011100101101010101110101110100000101000101110010011101100100111111110101110100000100101110111101101011110 e7f79eeba09ce8e5aaeba0a2e4ec9feba0977de7f79eeba09ce8e5aaeba0a2e4ec9feba0977b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)