To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 證?i爾臀濾億??證?i爾臀濾億??^ 11100110100110100011111110000010100010011000111010100010111001000101110011100000011010001000100110101101001111110011111111100110100110100011111110000010100010011000111010100010111001000101110011100000011010001000100110101101001111110011111101011110 e69a3f82898ea2e45ce06889ad3f3fe69a3f82898ea2e45ce06889ad3f3f5e
EUC-JP 證?i爾臀濾億??證?i爾臀濾億??^ 11101011111110100011111110100011111010011011110010100100111001111011110111011111110010011011001010101111001111110011111111101011111110100011111110100011111010011011110010100100111001111011110111011111110010011011001010101111001111110011111101011110 ebfa3fa3e9bca4e7bddfc9b2af3f3febfa3fa3e9bca4e7bddfc9b2af3f3f5e
UTF-8 證뜹i爾臀濾億꿴땡證뜹i爾臀濾億꿴땡^ 11101000101011011000100111101011100111001011100111101111101111011000100111100111100010001011111011101000100001111000000011100110101111111011111011100101100001001000010011101010101111111011010011101011100101011010000111101000101011011000100111101011100111001011100111101111101111011000100111100111100010001011111011101000100001111000000011100110101111111011111011100101100001001000010011101010101111111011010011101011100101011010000101011110 e8ad89eb9cb9efbd89e788bee88780e6bfbee58484eabfb4eb95a1e8ad89eb9cb9efbd89e788bee88780e6bfbee58484eabfb4eb95a15e
UHC 證뜹i爾臀濾億꿴땡證뜹i爾臀濾億꿴땡^ 11110001111110111011011011100101101000111110100111101100101100111101010011101011110101011110101111100101111000101011001011101001101101101010111111110001111110111011011011100101101000111110100111101100101100111101010011101011110101011110101111100101111000101011001011101001101101101010111101011110 f1fbb6e5a3e9ecb3d4ebd5ebe5e2b2e9b6aff1fbb6e5a3e9ecb3d4ebd5ebe5e2b2e9b6af5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)