To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???節??要??軟??堰??節??與??B 00111111001111110011111110010000110111110011111100111111100101110111011000111111001111111001001111101110001111110011111110001001100000010011111100111111100100001101111100111111001111111110010001101111001111110011111101000010 3f3f3f90df3f3f97763f3f93ee3f3f89813f3f90df3f3fe46f3f3f42
EUC-JP ???節??要??軟??堰??節??與??B 00111111001111110011111111000000111000010011111100111111110011011101011100111111001111111100011011110000001111110011111110110001111000010011111100111111110000001110000100111111001111111110011111010000001111110011111101000010 3f3f3fc0e13f3fcdd73f3fc6f03f3fb1e13f3fc0e13f3fe7d03f3f42
UTF-8 樂띷뒿節쏙슥要쎿쮵軟숅썚堰듸슬節삯찛與믥뀢B 11101111101001101011111111101011100111011011011111101011100100101011111111100111101011111000000011101100100011111001100111101100100010101010010111101000101001101000000111101100100011101011111111101100101011101011010111101000101110111001111111101100100010001000010111101100100011011001101011100101101000001011000011101011100100111011100011101100100010101010110011100111101011111000000011101100100000101010111111101100101100001001101111101000100010001000011111101011101011111010010111101011100000001010001001000010 efa6bfeb9db7eb92bfe7af80ec8f99ec8aa5e8a681ec8ebfecaeb5e8bb9fec8885ec8d9ae5a0b0eb93b8ec8aace7af80ec82afecb09be88887ebafa5eb80a242
UHC 樂띷뒿節쏙슥要쎿쮵軟숅썚堰듸슬節삯찛與믥뀢B 11101000111110011000110111100110100010101011010111101111101111011011110111101111101111011011101111101001101010011001101111100110101010001001001011100110111000111001100111101001100110111000110111100101111010001011010111101111101111011011110111101111101111011011101111101001101010011001101111100110101010001001001011100111100001011001100101000010 e8f98de68ab5efbdbdefbdbbe9a99be6a892e6e399e99b8de5e8b5efbdbdefbdbbe9a99be6a892e7859942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)