To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 厄μ?悠??邑る?瑤??悠у?矜伊??以 10010110111011111000001111001010001111111001011101001001001111110011111110010111010101111000001011101001001111111110101010100010001111110011111110010111010010011000010010000101001111111110000111100000100010001100100100111111001111111000100011001000 96ef83ca3f97493f3f975782e93feaa23f3f974984853fe1e088c93f3f88c8
EUC-JP 厄μ?悠??邑る?瑤??悠у?矜伊??以 11001100111100011010011011001100001111111100110110101010001111110011111111001101101110001010010011101011001111111111010010100100001111110011111111001101101010101010011111100101001111111110001011100010101100001100101100111111001111111011000011001010 ccf1a6cc3fcdaa3f3fcdb8a4eb3ff4a43f3fcdaaa7e53fe2e2b0cb3f3fb0ca
UTF-8 厄μ떜悠녶젆邑る뀆瑤뗭슧悠у슫矜伊숁만以 11100101100011101000010011001110101111001110101110010110100111001110011010000010101000001110101110000101101101101110110010100000100001101110100110000010100100011110001110000010100010111110101110000000100001101110011110010001101001001110101110010111101011011110110010001010101001111110011010000010101000001101000110000011111011001000101010101011111001111001111110011100111001001011110010001010111011001000100010000001111010111010011110001100111001001011101110100101 e58e84cebceb969ce682a0eb85b6eca086e98291e3828beb8086e791a4eb97adec8aa7e682a0d183ec8aabe79f9ce4bc8aec8881eba78ce4bba5
UHC 厄μ떜悠녶젆邑る뀆瑤뗭슧悠у슫矜伊숁만以 11100100111110001010010111101100100010111011001011101010111011011000011011100101101000001000100111101011111010011010101011101011100001011000001011101000111111011000101111101100100110101011000111101010111011011010110011100101100110101011010011010000111010001110110010100101100110011110011010111000101110001110110010100100 e4f8a5ec8bb2eaed86e5a089ebe9aaeb8582e8fd8bec9ab1eaedace59ab4d0e8eca599e6b8b8eca4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)