To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????Cx}??????????Cx{^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110100001101111000011111010011111100111111001111110011111100111111001111110011111100111111001111110011111101000011011110000111101101011110 3f3f3f3f3f3f3f3f3f3f43787d3f3f3f3f3f3f3f3f3f3f43787b5e
SJIS-WIN 日??逸??壹?暲?Cx}日??逸??壹?暲?Cx{^ 1001001111111010001111110011111110001000111011010011111100111111100110101110001100111111111110101101110000111111010000110111100001111101100100111111101000111111001111111000100011101101001111110011111110011010111000110011111111111010110111000011111101000011011110000111101101011110 93fa3f3f88ed3f3f9ae33ffadc3f43787d93fa3f3f88ed3f3f9ae33ffadc3f43787b5e
EUC-JP 日??逸??壹?暲?Cx}日??逸??壹?暲?Cx{^ 11000110111111000011111100111111101100001110111100111111001111111101010011100101001111111000111111000010110110110011111101000011011110000111110111000110111111000011111100111111101100001110111100111111001111111101010011100101001111111000111111000010110110110011111101000011011110000111101101011110 c6fc3f3fb0ef3f3fd4e53f8fc2db3f43787dc6fc3f3fb0ef3f3fd4e53f8fc2db3f43787b5e
UTF-8 日쇤길逸곈깅壹렱暲렲Cx}日쇤길逸곈깅壹렱暲렲Cx{^ 11100110100101111010010111101100100001111010010011101010101110001011100011101001100000001011100011101010101100111000100011101010101110011000010111100101101000111011100111101011101000001011000111100110100110101011001011101011101000001011001001000011011110000111110111100110100101111010010111101100100001111010010011101010101110001011100011101001100000001011100011101010101100111000100011101010101110011000010111100101101000111011100111101011101000001011000111100110100110101011001011101011101000001011001001000011011110000111101101011110 e697a5ec87a4eab8b8e980b8eab388eab985e5a3b9eba0b1e69ab2eba0b243787de697a5ec87a4eab8b8e980b8eab388eab985e5a3b9eba0b1e69ab2eba0b243787b5e
UHC 日쇤길逸곈깅壹렱暲렲Cx}日쇤길逸곈깅壹렱暲렲Cx{^ 1110110011101101101111001110100110110001111001101110110011101111101100001110100110110001111010111110110011101100100011101011111011101101111001111000111010111111010000110111100001111101111011001110110110111100111010011011000111100110111011001110111110110000111010011011000111101011111011001110110010001110101111101110110111100111100011101011111101000011011110000111101101011110 ecedbce9b1e6ecefb0e9b1ebecec8ebeede78ebf43787decedbce9b1e6ecefb0e9b1ebecec8ebeede78ebf43787b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)