To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲???沃??蚓??銀ъ????肉ヨぜ筍る? 1110000110011111001111110011111100111111100101111000000000111111001111111110010101101101001111110011111110001011111000101000010010001100001111110011111100111111001111111001001111110111100000111000100010000010101110101110001010100001100000101110100100111111 e19f3f3f3f97803f3fe56d3f3f8be2848c3f3f3f3f93f7838882bae2a182e93f
EUC-JP 癲???沃??蚓??銀ъ????肉ヨぜ筍る? 1110001010100001001111110011111100111111110011011110000000111111001111111110100111001110001111110011111110110110111001001010011111101100001111110011111100111111001111111100011011111001101001011110100010100100101111001110010010100011101001001110101100111111 e2a13f3f3fcde03f3fe9ce3f3fb6e4a7ec3f3f3f3fc6f9a5e8a4bce4a3a4eb3f
UTF-8 癲쑳살뵯沃쇨였蚓곩뼦銀ъ궩嶺뚮뿫肉ヨぜ筍る윿 1110011110011001101100101110110010010001101100111110110010000010101101001110101110110101101011111110011010110010100000111110110010000111101010001110110010011000100000001110100010011010100100111110101010110011101010011110101110111100101001101110100110001010100000001101000110001010111010101011011010101001111011111010011010101011111010111001101010101110111010111011111110101011111010001000001010001001111000111000001110101000111000111000000110011100111001111010110110001101111000111000001010001011111011001001110010111111 e799b2ec91b3ec82b4ebb5afe6b283ec87a8ec9880e89a93eab3a9ebbca6e98a80d18aeab6a9efa6abeb9aaeebbfabe88289e383a8e3819ce7ad8de3828bec9cbf
UHC 癲쑳살뵯沃쇨였蚓곩뼦銀ъ궩嶺뚮뿫肉ヨぜ筍る윿 1110111110100110100111001100111010111011111011001001010010101101111010001010101010111100111010101011111110110100111011001110001010000001111001011001011010101001111010111101111010101100111011001000001010111011111001111010110110001100111010111001011110101011111010111011111110101011111010001010101010111100111000101110110010101010111010111001111110110111 efa69ccebbec94ade8aabceabfb4ece281e596a9ebdeacec82bbe7ad8ceb97abebbfabe8aabce2ecaaeb9fb7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)