To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 豎夥さ竏・隱作ィ凩豎夥さ竏・隱作ィ几^ 111001101011000110011010111011001000001010110011111000101000100010100101111010001010101010001101111011001010100010011001011111011110011010110001100110101110110010000010101100111110001010001000101001011110100010101010100011011110110010101000100110010111101101011110 e6b19aec82b3e288a5e8aa8deca8997de6b19aec82b3e288a5e8aa8deca8997b5e
EUC-JP 豎夥さ竏・隱作ィ凩豎夥さ竏・隱作ィ几^ 11101100101100111101010011101110101001001011010111100011111010001000111010100101111100001010110010111010111011101000111010101000110100011101111011101100101100111101010011101110101001001011010111100011111010001000111010100101111100001010110010111010111011101000111010101000110100011101110001011110 ecb3d4eea4b5e3e88ea5f0acbaee8ea8d1deecb3d4eea4b5e3e88ea5f0acbaee8ea8d1dc5e
UTF-8 豎夥さ竏・隱作ィ凩豎夥さ竏・隱作ィ几^ 11101000101100011000111011100101101001001010010111100011100000011001010111100111101010111000111111101111101111011010010111101001100110101011000111100100101111011001110011101111101111011010100011100101100001111010100111101000101100011000111011100101101001001010010111100011100000011001010111100111101010111000111111101111101111011010010111101001100110101011000111100100101111011001110011101111101111011010100011100101100001111010000001011110 e8b18ee5a4a5e38195e7ab8fefbda5e99ab1e4bd9cefbda8e587a9e8b18ee5a4a5e38195e7ab8fefbda5e99ab1e4bd9cefbda8e587a05e
UHC ??さ??隱作????さ??隱作??^ 00111111001111111010101010110101001111110011111111101011110111111110110111000010001111110011111100111111001111111010101010110101001111110011111111101011110111111110110111000010001111110011111101011110 3f3faab53f3febdfedc23f3f3f3faab53f3febdfedc23f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)